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MULTIVALENT VACCINE FOR CLOSTRIDIUM BOTULINUM NEUROTOXIN 

FIELD OF THE INVENTION 

The present invention relates to the isolation of polypeptides derived from Clostridium 
hotulimim neurotoxins and the use thereof as immunogens for the production of N accines. 
including multivalent vaccines, and antitoxins. 

BACKGROUND OF THE INVENTION 

The genus Clostridium is comprised of gram-positive, anaerobic, spore-forming bacilli. 
The natural habitat of these organisms is the environment and the intestinal tracts of humans 
and other animals. Indeed. Clostridia are ubiquitous: they are commonly found in soil. dust, 
sewage, marine sediments, decaying vegetation, and mud. \Sec t'..tr.. P.H.A. Sneath et ai. 
"Clostridiumr Hartley 's Manual ^< o f Sy stematic Bacteriolojiy. Vol. 2. pp. 1141-1200. 
Williams & W ilkins ( 1986).| Despite the identification of approximately 100 species of 
( iostridiiim, only a small number have been recognized as eiiologic agents of medical and 
N cicrinar\ imporiancc. Nonetheless, these species are associated with very serious diseases, 
including botulism, tetanus, anaerobic cellulitis, gas gangrene, bacteremia, pseudomembranous 
coliiLs. and clo.siridial gastroenieriiis. Table 1 lists some of the species of medical and 
\eterinary importance and the diseases with which ihcy arc associated. .As virtually all of 
these species have been isolated from fecal samples of apparently healthy persons, some of 
these isolates may be transient, rather than permanent residents of tlie colonic ilora. 



TABLE 1 

Cloxtrhiititpt Species Of Medical And Veterinarx- Imponance* 



Species 


Disease 


( luninovulcricum 


Bacteriuria (pregnant women) 


C *. ars:ctuincn.\c 


infected wounds: Bacteremia: Botulism: Infections of amniotic Huid 


C. tmrutii 


Infected war wounds: Peritonitis: Infectious processes of the eye. ear 
and prostate 


C. hciicrinvkikii 


Infected wounds 


C. hih'rmt'nians 


Infected wounds: Abscesses: Gas Gangrene: Bacteremia 


C. tunutintm 


Food poisoning: Boiuiism (wound, food, infant) 


C. hiuyrwum 


lJrinar\' iraa. lower respirator)' trace, pleural cavity, and abdominal 
infections: Infected wounds: Abscesses; Bacteremia 


C, cudavchs 


Abscesses: Infected wounds 



wo 98/08540 



PCT/US97/15394 



TABLE 1 

Clostridium Species Of Medical And Veterinary Imponance* 



Species 


Disease 


('. t arnis 


Soil tissue infections: Dacicremia 


C. t huiivtivi 


BInckleu 


( \ vlnsindiojtH'mc 


Abdominal, cervical, scrotal, pleural, and other infections: Septicemia; 
F*eriionilis: Appendicitis 


*. . c t /» innif iiini 


Isolated from human disease processes, but role in disease unknown 


(\ ih/fici/c 


Antimicrobial-associated diarrhea: Pseudomembranous enterocolitis: 
Bacteremia: Pyogenic infections 


( '. luilux 


Soft tissue infections 


V. xihnnti 


Soft tissue infections 


('. \il\\nhcnm 


Wound infections: Abscesses: Peritonitis 


( '. IhisfiiarnK' 


Infected war wounds; Birctercinia: Abscesses 


( hiMniyncitm 


Infected war wounds; (Jas gnnirrene: Ciiuiiival pUu|ue isolate 


( \ iihiolts 


Gastrointestinal tract infections 


( ' innncmini 


Gastrointestinal tract mfections: limpyenui 




Penile lesions 


\ . {{.Ill flfll 


Isolated from human disease processes, but role in disease unknown. 


C littuisum 


Bacteremia: Peritonitis: Pulmonary inlections 


( \ tmiiK'nittnuwium 


Various infectious processes 




Infected wounds: Gas uangrene: lllackleu. Bit: head (ovine): Rcdwnicr 
disease (bovine) 


( \ ttroiiaim 


IJrinarv trnct infi^ctionc' RL*ft:il :ihvri*«:ci*c 


( ' fHtriffw/nfiaim 


Bacteremia: Peritonitis; Infected wounds: Appendicitis 


( iKTfrin\>vns 


(ias gangrene: Anaerobic cellulitis: Intra-abdominal abscess*es: Sofi 
(i^Mit, inidriiun^. 1 oou poisonmi!. tNecroiizm^ pneumonini Lmpvema: 
Meningitis; Bacteremia: Uterine Infections: tnterilis necrotans: Lamb 
dysentery: Struck; Ovine Entcroio.\emia: 


( '. fnutvfucicm 


Bacteriuria (Preunanl women with bncteremin) 


( pmrt/icmH 


Abscesses: Infected wounds: Bacteremia 


('. runto.stiin 


Infections of the abdominal cavity, uenital tract, lunji. and biliary tract; 
Bacteremia 


( sunu\z<if(*niw 


Isolated from liuman disease processes, but role in disease unknown. 


(\ scpficiun 


Gas gangrene; Bacteremia: Suppurative inlections; Necroiizitii: 
enterocolitis: Bra.xy 


C. sonicllii 


Gas !inni»rene: Wound infections: Penile lesions; Bacteremia: 
Abscesses: Abdominal and vajiinni inteciions 
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TABLE I 



( lostf'fdium Species OF Medical And Veterinary Importance* 



Species 


Disease 


( *. splicnokhs 


Appendicitis: Bacteremia: Bone and sot'i tissue infections: 
Intraperitoneal injections: Infecicd war wounds: Visceral aas gangrene: 
Renal abscesses 


( '. sptn ttiicncs 


Gas gangrene: Bacteremia: Endocarditis: central nervous system and 
pleuropulmonary infections: Penile lesions: infected war wounds: 
Other pyogenic infections 


( '. snhwrnunalc 


Bacteremia: Empyema; Biliary tract, soft tissue and bone infections 


i '. synihittsum 


Liver abscesses: Bacteremia: Infections resuhint; due to liowcl Morn 




Gas ganyrene: Appendicitis: Brain abscesses: Intestinal tract and soft 
tissue infections; Iiitectcd wnr wounds: i'eriodoniiiis: Bacteremia 


h'/uiii 


Tetanus: Infected gums and teeth: Corneal ulcerations: Mastoid and 
middle ear infections: Intraperitoneal infections; feianus neonatorum; 
Postpartum uterine infections: Soft tissue infeciions. especially related 
lo irnumn (including abrasions ai>d lacerations): Infections related to 
use of contaminated needles 


( '. ihcrmtisavvhurolvncum 


Isolated from hum«nn disease processes, but role in disease unknown. 



C iinipilcd from I'.G. Engelkirk c7 ol. 'Xlu.s.stjkahntr\ Phnciphw unti nntchcc nf Clinical Anucmhic 
nttiunntosix, pp. 22-2.1. .Star Publishing Co.. Belmont. CA (I902): J. Stephen and k.A. IVtrowski. 
"Itnins W hich Ti averse Afcnihnttws am/ Dcrcf^u/uw Cc/ls." in liacicrial Toxins, 2d ed.. pp. 66-67. 
American Society for Microbiology (1986); K. Berkow and A.J. I letchcr (eds.j. Ihuwrni/ l)isciiH's\" 
MctcA Manual nf Diagnosis and Thcta/iy, I6ih ed.. pp. 1 16-126. Merck Research Laboratories, Rahway. 
N.J. (1902): and O IL Sigmund and CM. Fraser (eds.). Xlostridial Infeciions/' Ma\k \ cfchnan 
MitnmtL 5ih ed.. pp. 396-409. Merck & Co., Rahway. N.J. (1979). 

In most cases, the pathogenicity of these organisms is related to the release ol' power! ul 
exotoxins or highly cle.siruciive enzymes. Indeed, several species of the genus ( Instridium 
produce toxins and other enzymes of great medical and veterinary significance. |C,L. 
Ilatheway. t'lin. Microbiol. Rev. 3:66-98 (I990),J 

Perhaps because of their significance for human and veterinary medicine, much 
research has been conducted on these toxins, in particular those ofC. hntulinum and C 
iliflkih, 

C. hotutmum 

Several strains o{' Clostriilinm hofulinum produce toxins of signilkancc to human and 
animal health. |C,L. Halhcway. Clin. Microbiol. Rev. .3:66-98 (1990)| The etrccls of these 
toxins lange from diarrheal di.scases that can cause destruction of the colon, to paralytic 
effects that can cau.se death. Particularly at ri.sk for developing clostridial di.scases are 
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neonates and humans and animals in poor health (c.^c., those suHering from diseases 
associated with old age or immunodeficiency diseases). 

ClostrUUwn boiulmum produces the most poisonous biological toxin known. The 
lethal human dose is a mere 10" mg/kg bodywcight for loxin in the bloodstream. Botulinal 
5 loxin blocks nerve transmission to the muscles, resulting in tlaccid paralysis. When the loxin 
reaches airway and respiratory muscles, it results in respiratory failure ihai can cause death. 
|S. Anion. J. Infect. Dis. 154:201-206 ( 1986) | 

( hotulhwm spores are carried by dust and arc found on vegetables lukcii trom the 
soil, on fresh fruits, and on aiiricultural products such as honey. Dndcr conditions favorable 

10 lo the organism, the spores germinate to vegetative cells which produces loxin, [S. Arnon. 
Ann. Rev. Med. 31:541 (1980)1 

Botulism disease may be grouped into four types, based on the method of introduction 
of lo.xin into the bloodstream. Food-borne botulism results IVom ingesiinLi impropcrlv 
preserved and inadequately heated food that contains botulinal lo.xin. There were 355 cases of 

15 food-borne boiulism in the United Slates between 1976 and 1984. [K.L. MacDonald cv ai. 

Am. .1. I-ipidemiol. 124:794 (1986).] The death rale due lo botulinal loxin is 12% and can be 
hi<!her in particular risk uroups. fC\0. Tackei et uL, Am. .1. Med. 76:794 ( 1984). j Wound- 
induced botulism results from C. hoii/linum penetrating traumatized tissue and producing toxin 
that is absorbed into the bloodstream. Since 1950. thirty cases ol" wound boiulism have been 

20 reported. |M.N. Swart/, '\\ncwrohic Sporc-i'ormm\i Iktcilli: The ('/(fsfridia." pp. 633-646. in 
li.D. Davis tv <//..(eds.). Micrnhiolo^y. 4ili edition. J.U. I.ippincoll C o. (1990).| Inhalation 
botulism results when the toxin is inhaled. Inhalation botulism has been reponed as the result 
of accidental exposure in the laboratory |n. llob.er. Med. Klin. 41:1735 fl962lj and could 
arise if the loxin is u.sed as an agent of biological warfare |I).R. I'ranz a uL. in Botulinimi 

25 ami Tvtcmns Xcuroioxins, MM, Da.sGupta. ed.. Plenum Press. New York (1993). pp. 473.476]. 
Infectious infant botulism results from ('. boiulmum colonization of the infant intestine with 
production of toxin and its absorption into the bloodstream. It is likely that the bacterium 
gains entry when spores are ingested and subsequently germinate. |S. Arnon. J. Infect. I3is. 
154:201 ( 1 986). I There have been 500 cases reported since it was first recognized in 1976. 

30 I M.N. Swartz. supra.] 
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Infant botulism strikes infants who are three weeks to eleven montiis old (greater than 
90% of the cases are infants less than six months). (S. Arnon. J, Infect. Dis. 154:201 (I986).i 
It is believed that infants are su.sccptible. due, in large pan. to the absence of the full adult 
complement of inte.stinal microllora. The benign microflora present in the adult intestine 
IJrovide an acidic environment that is not favorable to colonization by C. hoiu/immi. Infants 
begin life with a sterile intestine which is gradually colonized by microflora. Because of the 
limited microflora present in early infancy, the intestinal environment is not as acidic, 
allowing for C. bonilhnim spore germination, growth, and to.xin production. In this regard, 
some adults who have undergone antibiotic therapy which alters intestinal microllora become 
more su.sceptible to botulism. 

An additional factor accounting for infant susceptibility to infectious botulism is the 
immaturitx of the infant immune system. The mature immune .system is sensitized to 
bacterial antigens and produces protective antibodies. .Secretory IgA produced in the adult 
intestine has the ability to agglutinate vegetative cells olC. hoitilimini. |S. Arnon. J. Infect. 
Dis. 154:20! ( I986).| Secretory IgA may al.so act by preventing intestinal bacteria and their 
products from crossing the cells of the intestine. [S, Arnon. Epidemiol. Rev. 3:4.^ (I98l).| 
The infant immune .sy.stem is not primed to do this. 

Clinical symptoms of infant botulism range from mild paralysis, to moderate and 
severe paral\sis requiring hospitalization, to fulminant paralysis, leading to sudden death. |.S. 
.Arnon. lipidcmiol. Rev. 3:4.5 (1981).] 

The chief therapy for se\ ere infant botulism is ventilatory assistance using a 
mechanical respirator and concurrent elimination oftoNin and bacteria using cathartics, 
enemas, and gastric lavage. There were 68 hospitalizations in California lor infant botulism 
in a single x car with a total cost of over $4 million for treatment. \TX.. I-ranko\ ich and .S. 
Arnon. West. .1. Med. 154:103 (l')91).| 

Different strains of ClosirkUimi hoiiilinum each produce antigenically distinct toxin 
designated by the letters A-G. Serotype A toxin has been implicated in 26% of the eases of 
food botulism; types B. E and F have also been implicated in a .smaller percentage of the food 
botulism ca.ses (II. Sugiyama. Microbiol. Rev. 44:419 ( 1980)]. Wound botulism has been 
reportedly caused by only types A or B to.xins fH. Sugiyama. siii>rci\. Nearly all ca.ses of 
infant botulism have been caused by bacteria producing either type A or type B to.xin. 
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(Exceptionally, one New Mexico case was caused by Closividium hofttlinum producing type F 
loxin and another by Clostridium hofulinum producing a type B-lype F hybrid.) fS. Arnon. 
Epidemiol. Rev. 3:45 (I98]).| Tyfic C loxin alTccls waiert'owl. callle. horses and mink. Type 
I) loxin alTccis caiile. and type E loxin alTecis boih humans and birds. 
5 A irivalenl antitoxin derived from horse plasma is commercially available from 

Connaughl Industries Ltd. as a therapy for toxin types A. B. and |{. However, the antitoxin 
has several disadvaniaiies. First, extremely large dosages must be injected inira\'enousl\' 
und/or intramuscularly. Second, the antitoxin has serious side elTecis such as acute 
anaphylaxis which can lead to death, and serum sickness, l inally. the ertlcacy ot the 
10 antitoxin is uncertain and the treatment is costly. |C.(). Jacket t'/ oL, Am. J. Med. 76:794 
{I084).l 

.\ heptavalent equine botulinal antitoxin which uses only the F(ab*)2 portion of the 
antibody molecule has been tested by the United States Military. (M. Balady. IJSAMRDC 
Newsletter, p. (> ( IW|).j This was raised aiiain.si impure toxoids in those large animals and is 

1.^ not a hiuh liter preparation. 

.'\ peniavaleni human antitoxin has been collected from inimunized human subjects for 
use as ;i treatment for infant botulism. The supply of this anlitoxin is limited and cannot be 
expected to meet the needs of all individuals stricken with botulism disease. In addition, 
collection ol' human sera must involve screening out IlIV and other potentially serious human 

20 pathogens. Schwarii and S.S. Arnon. Western .!. Med. 156:197 (IW2).l 

Inlani botulism has been implicated as the cause of mortality in some cases of Sudden 
Infant Death Syndrome (SIDS. aLso known as crib death). SIDS is oftlcially recognized as 
infant death that is sudden and unexpected and that remained unexplained despite complete 
post-mortem examination. I he link oi* SIDS to infant botulism came when fecal or blood 

25 specimens taken at autopsy I'rom SIDS infants were found to contain hoiulinum organisms 
and/or toxin in 3-4% of cases analyzed. [D.R. Peterson cf at.. Rev. Infect. Dis. 1:630 
(1^)79).] In contrast, only I of 160 healthy infants (0.6%) had L\ hatidintim organisms in the 
feces and no botulinal loxin. (S. Arnon at ai. Lancet, pp. 1273-76. .lune 17. 1078.) 

In developed countries. SIDS is the number one cause of death in children between 

.^0 one month and one year old. {S. Arnon ci uL. Lancet, pp. 1273-77. .lunc 17. 1978.) More 
children die from SIDS in ihe first year than from any other single cause of death in the first 
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Iburteeh years ol" life. In the United States, iliere are 8.000-10.000 SIDS victims annually. 

/(/. 

What is needed is an effective therapy against infant botulism that is free of dangerous 
side effects, is available in large supply at a reasonable price, and can be safely and gently 

• 5 delivered so that prophylactic application to infants is feasible. 

Immunization of subjects with toxin preparations has been done in an attempt to 
induce immunity against botulinal toxins. A C*. hotuliniim vaccine comprising chemically 
inactivated {i.e.. formaldehyde-treated) type A. B. C. D and F toxin is commercially available 
for human usage. Uo\vc\er. this vaccine preparation has se\eral disadvantages. First, the 

10 efficacy of this vaccine is variable (in particular, only 78% of recipients produce protective 
levels of anti-lype 13 antibodies following administration of the primary scries). Second, 
immunization is painlul (deep subcutaneous inoculation is required for administration), with 
adverse reactions being common (moderate to severe local reactions occur in approximateh' 
(>% of recipients upon initial injection: this number rises to approximately 1 1% of individuals 

\5 who receive k)osier injections) [Informational Brochure for the Peniavalent (AUCDF:) 
Uolulinum foxoid. Centers for Disea.se Control]. Third, preparation of the \accine is 
dangerous as active toxin must be handled by laboratory workers. 

What is needed arc siifc and effective vaccine preparations for administration to those 
at risk of exposure to ( '. hotulimuu toxins. 

20 

C\ difftcile 

( ' (li/ficik\ an organism which gained its name due lo difficulties encountered in its 
isolation, has recently been proven to he an etioloiiic agent of diarrheal disease. (.Sneaih a 
ui. p. 1 Ui5.). ('. diffk'ilv is present in the gastrointestinal tract of approximately 3% of 

25 healthy adults, and )0-3()% of neonates without adverse effect (.Swariz. at p. 644): by other 

estimates. C. difficile is a part of the normal gastrointestinal tlora of 2-10% ol* humans. |G.F. 
Brooks cv (//.. (eds.) 'Infcvtinns i 'aused by Anacrohic ISacicria" Jawviz, Melnick. <i 
Ailclher*i 's Medical \'licrohiolo^\\ 19th cd.. pp. 257-262. Appleton & Lange. San Matetv CA 
(1W|).| As these organisms are relatively resistant to most commonl) used antimicrobials. 

30 when a patient is treated with antibiotics, the other members of the normal gastrointestinal 
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flora are suppressed and C: Uifficile flourishes, producing cyiopaihic toxins and cmeroioxins. 
Ii has been found in 25% of cases of moderate diarrhea resulting from treatment with 
antibiotics, especially the cephalosporins, clindamycin, and ampicillin. [M.N, Swariz at 644.] 

Importantly. C. difficile is commonly associated with nosocomial infections. The 
organism is often present in the hospital and nursing home environments and may be carried 
on the hands and clothing of hospital personnel who care for debilitated and 
immunocompromised patients. As many of these patients arc being treated with 
antimicrobials or other chemolherapeutic agents, such transmission of Jiff idle represents a 
significant risk factor for disease. (Engelkirk ei a/,, pp. 64-67.) 

C, difficile is associated with a range of diarrheiic illness, ranging from diarrhea alone 
to marked diarrhea and necrosis of the gastrointestinal mucosa with the accumulaiion of 
inflammatory cells and fibrin, which forms a pseudomcmbrane in the affected area. (Brooks 
cv (//.) It has been found in over 05% of pseudomembranous enterocolitis cases. {Swariz. at 
p. 644.) This occasionally lalal disease is characterized by diarrhea, multiple small colonic 
plaques. :md toxic megacolon. (Swariz. at p. 644.) Although .stool cultures are sometimes 
used lor diagnosi.s. diagnosis is be.st made by detection of ihe heat labile toxins present in 
fecal iilirates from patients with enterocolitis due to C. difficile. (Swartz. at p. 644-645: and 
Brooks cf ui, at p. 260,) i \ difficile to.vins are cytotoxic lor tissue/cell cultures and cause 
enterocolitis when injected iniracccally into hamsters. (Swariz, ai p. 644.) 

I he enterotoxiciiy of difficile is primarily due to the action of two toxins, 
designated A and U. each of approximately 300.000 in molecular weight. IJoth are potent 
cyioio.xins, with to.xin A possessing direct cnierocytotoxic activity. |I.yerly et af. Infect. 
Immun. 60:4633 (1^)92).| Unlike toxin A of C. pcrfrin^ens. an organism rarely associated 
with antimicrobial-associated diarrhea, the toxin of C. difficile is nt)t a spore coat constituent 
and is not produced during sporulation. (Swartz. at p. 644.) C difficile toxin A causes 
hemorrhage, fluid accumulation and mucosal damage in rabbit ileal loops and appears to 
increase the uptake of toxin B by the intestinal mucosa. Toxin I) does not cause intestinal 
fluid accumulation, but it is 1000 times more toxic than toxin A to tissue culture cells and 
causes membrane damage. Although both toxins induce similar cellular effects such as actin 
disaggregation, differences in cell specificity occurs. 
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Both toxins are important in disease. fBorriello ei ciL. Rev, Infect. Dis.. I3(suppl. 
2):S185 (1990): Lyerly ei ciL. Infect. Immun.. 47:349 ( 1985): and Rolfe, Infect. Immun.. 
59:1223 (1990).) Toxin A is thought to act first by binding to brush border receptors, 
dcstrov ing the outer muco.sal layer, then allowing toxin B to gain access to the underlying 
tissue. These steps in pathogenesis would indicate that the production of neutralizing 
antibodies against toxin A may be sufficient in the prophylactic therapy of CDAD. However, 
antibodies against toxin B may be a necessary additional component for an effective 
therapeutic against later stage colonic disease. Indeed, it has been reported that animals 
require antibodies to both toxin A and toxin B to be completely protected against the disease, 
I Kim and Rolfe. Abstr. Ann. Meet. Am. Soc. Microbiol.. 69:62 (1987).) 

( iti/fkUc has also been reported to produce other toxins such as an enteroioxin 
different from toxins A and B |Banno ci al.. Rev. Infect. Dis.. 6(Suppl. 1:S1 1-S20 (i984)|. a 
low molecular weight toxin |Rihii ci if/.. Biochem. Biophys. Res. Comm.. 124:690-695 
{ I9X4)|. a motility altering laclor [Justus ci a/.. Gaslroemcrol.. K3:X36-843 (198:)|. and 
perhaps other toxins. Regardless. ( Jif/ici/c gastroinicsiinal disease is of primary concern. 

Il is signillcant that due to its resistance to mosi commonly used antimicrobials. (\ 
iliffidic is associaicd with amimicrobial therapy with virtually all antimicrobial agents 
(although most commonly ampicillin. clindamycin and cephalosporins). It is also a.ssociated 
with disease in patients undcruoing chemotherapy with such compounds as methotrexate. 5- 
llu(>rour:iciL cyclophosphamide, and doxorubicin, (S.M. Finegold a uL, Clinicul ihnile to 
Anacrohic Injcaions, pp. 88-89. Star Publishing Co.. Belmont. CA (1992).] 

I rcaimeni ol (*. Jif/icilc disea.se is problematic, given the high resistance of the 
organism. Oral metronidazole, bacitracin and vancomycin have been reported to be effective, 
(f inegold ct t/A. p. 89.) However there are problems as.sociated with treatment utilizing these 
compounds. Vancomycin is very expensive, some patients are unable to take oral medication, 
and the relapse rate is high (20-25%). although it may not occur for .several weeks. Id 

C. i/ifikile disease would be prevented or treated by neutralizing the effects of these 
toxins in the gastrointestinal tract. Thus, what is needed is an effective therapy against ( \ 
illfficilv toxin that is free of dangerous side effects, is available in large supply at a reasonable 
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price, and can be safely delivered so that prophylactic application lo patients ai risk of 
developing pseudomembranous enterocolitis can be elTectivcly treated. 

DESCRIPTION OF THE DRAWINGS 

I'igiire I shows the reactivity oranli-C' hotuHnum IgY by Western blot. 
I'igure 2 shows the IgY antibody titer to (' hoiiilinum type A toxoid in eggs, measured 
by lil.lSA. 

l-igure .1 shows the results ofC. iiijf/ici/e toxin A neutralization assays. 

Figure 4 shows the results ofC difficile loxin B neutralization assa\s. 

I'igurc 5 shows the results of C \ difficile loxin B neutralization assays. 

l igure 6 is a restriction map of C. difficile loxin A gene, showing sequences of 
primers 1-4 {SEQ ID NOS:l-4). 

l igure 7 is a Western bloi of ('. difficile toxin A reactive protein. 

l-igure X shows ( difficile loxin A expression consirucis. 
I> l igure 0 shows (' difficile loxin A expression constructs. 

f igure 10 shows the purillcalion of recombinant (*. difficile toxin A. 

figure 1 1 shows the results of ('. difficile loxin A neuiralizaiion assays with antibodies 
reactive lo recombinani loxin A. 

figure 12 shows the results for a ('. difficile toxin A neutralization plate. 
20 f igure 13 shows the results for a C. difficile toxin A neutralization plate. 

figure 14 shows the results of recombinant difficile toxin A neutralization a.ssays. 

figure 15 shows ('. difficile toxin A expression constructs. 

figure 16 shows a ehromaiograph plotting absorbanee at 280 nm against retention time 
lor a pMAl 870-680 IgY PIXJ preparation. 
25 I'igure 1 7 shows two recombinant ( difficile toxin B expression constructs, 

f igure 18 shows C. difficile toxin B expression constructs, 
l-igure M) shows (*. difficile toxin B expression constructs, 
l-igure 20 shows difficile loxin B expression constructs. 

figure 21 is an SDS-F'ACiL gel showing the purillcation of recombinani ( *. difficile 
.>0 toxin H fusion protein. 
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Figure 22 is an SDS-PAGE gel showing ihe purification of two hislidinc-iagged 
recombinant C\ difficile toxin B proieins. 

Figure 23 shows (\ difficile toxin B expression constructs. 

Figure 24 is a Western blot of (*. difficile toxin H reactive protein. 
5 Figure 25 shows ( ' botnlmum type A toxin expression constructs: constructs used to 

prov ide r. houflimim or difficile sequences are also shown. 

Figure 26 is an SDS-PAGH gel stained with Coomaisse blue showing the purification 
ol' recombinant botulinum type A toxin fusion proteins. 

Figure 27 shows (' hotulimm ij pe A toxin expression constructs: constructs used to 
10 provide (' hoiulinum sequences are also shown. 

I'igurc 28 is an SDS-PAGF gel stained with Coomaisse blue showing the purification 
ofpllisBoi protein using the Ni-NTA resin. 

l-iuurc 2^) is an SDS-PAOH gel stained with Coomaisse blue showing the expression of 
pliisBot protein in UI.21(DC3) and BL21{DF:3)pLysS host cells. 
1^ I igure 30 is an SDS-PAGF gel stained with Coomaisse blue showing the purification 

ol pllisBoi protein using a batch absorption procedure. 

1-igure 31 is an SDS-I\'\GF gel siaincd with Coomaisse blue showing the purification 
of pl lisBt)i and pi lisBot( native) proteins using a Ni-N TA column. 

r'iuure 32 is an SDS-PAGF gel stained with Coomaisse blue showing the purification 
20 ol pllisBoiA protein expressed in pllisBotA(syn) kan laclq T7/pACYCGro/B1.2 UDE3) cells 
using an IDA column, 

I'igurc 33 is an SDS-PAGH gel sUiined wilh Coomaisse blue showing ihe purification 
ol pllisBoiA. pHisBotB and pllisBotf: proteins by IDA chromatography followed by 
chromatography on S-lOO to remove folding chapcrones. 
25 I'iuure 34 is an SDS-PAGF gel stained wilh Coomaisse blue showing the extracts 

derived from pHisBoiB amp T7lac/BL2l(DE3) cells before and after purification on a Ni- 
NTA column. 

Figure 35 is an SDS-I*AGF gel run under native conditions and stained with 
Coomaisse blue showing the removal of folding chapcrones iVom IDA-purified BotB protein 
30 using a S-lOO column. 
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Figure 36 is an SDS-PAGI: gel slained with Coomaisse blue showing proteins thai 
cluled during an imidazole step gradient applied lo a IDA column containing a l\ saie of 
pHisBotB kan laclq T7/pACYCGro/BL21(DE:>) cells. 

l igure 37 is an SDS-PAGE gel run under native conditions and stained with 
Coomaisse blue showing IDA-puritlcd BolB protein before and after ultraniiraiion. 

Figure 38 is an SDS-PAGE gel stained with Coomaisse blue showing ihc purification 
nl" BoiE protein using a NiNTA column. 

l-igure 39 is an SDS-PAGE gel stained with Coomaisse blue showing extracts derived 
from pIlisBotA kan T7 lac/BL2l(DE3) pLysS cells grown in fermeniation culture. 

Figure 40 is a chromatogram showing proteins present after IDA-purilicd Boil: protein 
was applied to a S-lOO column. 

DEFINITIONS 

To faciliiaie understanding of the invention, a number of terms arc dcdncd below. 

As used herein, the term "neutralizing" is used in reference lo antitoxins, pariiculariy 
antitoxins comprising antibodies, which have the ability to prevent the pathological actions of 
the toxin against which the antitoxin is directed. 

As used herein, the term "overproducing" is used in reference lo the production of 
clostridial loxin polypeptides in a host cell and indicates that the host cell is producing more 
i>f ihc clo.siridial toxin by virtue of the introduction of nucleic acid sequences encoding said 
clostridial toxin polypeptide than would be expressed by said host cell absent the introduction 
ol said nucleic acid sequences. To allow ease of purincaiion of toxin polypeptides produced 
in a host cell it is preferred that the host cell express or overproduce siiid toxin polypeptide at 
a level greater than 1 mg/liter of host cell culture. 

".A hosl cell capable of expressing a recombinant protein al a level greater than or 
equal to 5% of the total cellular proiein" is a host cell in which the recombinant proiein 
represents at leasi 5% of the total cellular proiein. To determine what percentage ol' total 
cellular protein the recombinant protein represents, the following steps are taken. A total of 
10 ()D,„„. units of recombinant ho.st cells (e..v.. 200 m1 of cells al OD,,,, 50/mli are removed 
(at a limepoinl known lo represent the peak of expression of the desired recombinant proiein) 
to a 1.5 ml microfuge lube and pelleted for 2 min at maximum rpm in a microfuge. I'he 
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pelleis are resuspended in I ml of 50 mM NaHPO,, 0.5 M NaCl. 40 niM imidazole buffer 
(pH 6.8) containing 1 mg/ml lysozyme. The samples arc incubated for 20 min at room 
temperature and stored ON at -IQ^'C. Samples are thawed completely at room temperature 
and sonicated 2X10 seconds with a Branson Sonifier 450 microtip probe al U 3 power 
setting. The samples are centrifuged for 5 min. at maximum rpm in a microfuge. An aliquot 
(20 |.il) of the protein sample is removed to 20 pi 2X sample buffer (this represents the total 
protein extract). The samples are heated to 95X for 5 min. ihen cooled and 5 or 10 nl are 
loaded onto 12.5% SDS-PAGE gels. High molecular weight protein markers are also loaded 
10 allow for estimation of the MW of identified recombinant proteins. After electrophoresis, 
proiein is delected generally by staining with toomassie blue and the stained gel is scanned 
using a densitometer to determine the percentage ol" protein present in each band. In this 
manner, the perceniage of protein present in the band corresponding to the recombinant 
protein ol" interest may be determined. It is not necessary that C'oomassic blue be emplovcd 
tor the detection of proiein. a number of fluorescent dyes (c'.,t:,. Sypro orange S-6651 
(Molecular Probes, liugene, OR) may be employed and the stained gel scanned using a 
lluoroimagcr jc-.i:.. I'luor Imager SI (Molecular Dynamics. Sunnyvale. CA)|. 

"A hnsi cell capable of expressing a rccombinani proiein as a soluble proiein al a level 
j^rcaicr ihan or equal to 0.25% of the loial soluble cellular proiein" is a hosi cell in which the 
amount of soluble recombinant protein present represents al least 0.25% of the total cellular 
protein. As used herein "total soluble cellular protein" refers to a clarified PFJ lysate 
prepared as described in H.xample 3l(c)(iv). Brielly. cells are harvested following induction 
i>r expression of recombinant protein (nt a poinl ol' maximal expression). The cells are 
resuspended in cell resuspension buffer (CRB: 50 mM NaPO^. 0.5 M NaCI. 40 mM 
imidazole, pi I 6.8) lo create a 20% cell suspension (wet weight of cellsA olume of CRB) and 
cell lysaies arc prepared as described in E.xample 3l(c)(iv} (/.c. sonicaiion or homogenizaiion 
Ibllowed by centrifugation). The cell lysale is then flocculated utilizing polyethyleneimine 
(PF:I) prior to centrifugation. PEI (a 2% solution in dH.O. pH 7.5 with IICl) is added to the 
cell ly.sate to a final concentration of 0.2%. and stirred Ibr 20 min at room temperature prior 
to centrifugation |8.50() rpm in JAM) rotor (Beekman) for 30 minutes at 4%'\, This treatment 
removes RNA. DNA and cell wall coinponents. resulting in a clarilled. low vi.scosity lysate 
("Pi:{| clarified lysale"). The recombinant proiein present in the PKI clarit'ied lysate is then 
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piiritlcd (c..c:., by chromatography on an IDA column for his-iaggcd proteins). The amount of 
purified recombinant protein (/.t\. the eluted protein) is divided by the concentration of 
protein present in the PL-I clarified lysatc (typically 8 mg/nil when using a 20% cell 
suspension as the starting material) and muhiplicd by 100 to determine what percentage of 
total soluble cellular protein is comprised of the soluble recombinant protein (see Hxample 
33b). 

As used herein, the term "llision protein" refers to a chimeric protein containing the 
protein of interest {i.e., ( \ hotulinum loxin A, B. C. D. 0. h. or Ci and fragments thereof) 
joined to an exogenous protein fragment (the fusion partner which consists of a non-toxin 
protein). The fusion partner may enhance solubility of the (' hondinum protein as expressed 
in a ho.si celL may provide an affinity tag to allow purification ol* the recombinant fusion 
protein from the host cell or culture supernatant, or both. If desired, the fusion protein may 
be remo\cd from the protein ol' interest [i.e., loxin protein or fragments thereof) prior to 
immunization by a variety of enzymatic or chemical means known to the art. 

/\s used herein the term "non-toxin protein" or "non-toxin protein sequence" refers to 
thai portion oi a fusion protein which comprises a protein or protein set|uencc w hich is not 
derived from a bacterial toxin protein. 

Hie term "protein of interest" as used herein refers to the protein whose expression is 
desired within the fusion protein. In a fusion protein the protein of interest will be joined or 
fused vNith another protein or protein domain, the fusion partner, to allow tor enhanced 
stability of the protein of interest and/or ease of purification of the fusion protein. 

.\s u.sed herein, the term "malto.se binding protein" refers to the maltose binding 
protein of A'. a#//. A portion ol' the maltose binding protein may be added to a protein of 
interest to generate a fusion protein: a portion of the mallo.se binding protein may merely 
enhance the solubility of the resulting fusion protein when expressed in a bacterial host. On 
the other hand, a portion of the maltose binding protein may allow affinity purification of the 
fusion protein on an amylose resin. 

As used herein, the term "poly-hisiidinc tract" when used in reference to a fusion 
protein refers to the presence of two to ten hislidine residues at either the amino- or carboxy- 
terminus of a protein of interest. A poly-hislidine tract of six to ten residues is preferred. 
The poly-histidine tract is also defined functionally as being a number of con.secuiive histidine 
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residues added to the protein of interest which allows the affinity purification of the resulting 
fusion protein on a nickel-chelatc or IDA column. 

As used herein, the term "purified" or "to purify" refers to the removal of 
contaminants from a sample, l-or example, antitoxins arc purified by removal of 
contaminating non-immunoglobulin proteins: they are also purified by the removal of 
immunoglobulin that does not bind loxin. The removal of non-immunoglobulin proteins 
and/or the removal of immunoglobulins that do not bind toxin results in an increase in the 
percent ol' toxin-reactive immunoglobulins in the sample. In another example, recombinant 
toxin polypeptides are expressed in bacterial host cells and the toxin polypeptides arc purified 
by the removal of host cell proteins: the percent of recombinant toxin polypeptides is thereby 
increased in the sample. Additionally, the recombinant toxin polypeptides are purified by the 
removal ol* host cell components such as lipopolysaccharidc (t'-x-. endotoxin). 

The term "recombinant DNA molecule" as used herein refers to a DNA molecule 
which is comprised of segments of DNA joined together by means ol' molecular biological 
techniques. 

I'he term "recombinant protein" or "recombinant polypeptide" as used herein refers to 
a protein molecule which is expressed from a recombinant [>NA molecuic. 

The term "native protein" as used herein refers to a protein which is isolated from a 
natural source as opposed to the production of a protein by recombinant means. 

As used herein the term "portion" when in reference to a protein (a.s in "a portion ol*a 
given protein") refers to iVagmenis of that protein. The fragments may range in si/e from 
lour amino acid residues to the entire amino acid sequence minus one amino acid. 

As used herein "soluble" when in reference to a protein produced by recombinant 
DNA technology in a host cell is a protein which exists in solution in the cytoplasm of the 
host cell: if the protein contains a signal sequence the soluble protein is exported to the 
periplasmic space in bacteria hosts and is secreted into the culture medium in eucaryolic cells 
capable of secretion or b\ bacterial host possessing the appropriate genes (i.e.. the kil gene). 
In contrast, an insoluble protein is one which exists in denatured form inside cytoplasmic 
granules tcalled inclusion bodies) in the host cell. High level expression {i.e., greater than 10- 
20 mg recombinant protein/liter of bacterial culture) of recombinant proteins often results in 
the expressed protein being found in inclusion bodies in the bacterial host cells. A soluble 
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protein is a protein which is not found in an inclusion body inside the host cell or is found 
both in the cytoplasm and in inclusion bodies and in this case the protein may be present at 
high or low levels in the cytoplasm. 

A distinction is drawn between a soluble protein {i.e. a protein which when expressed 
in a host cell is produced in a soluble form) and a "solubilized" protein. An insoluble 
recombinant protein found inside an inclusion body may be solubilized {i.e. rendered into a 
soluble form) by treating purified inclusion bodies with denaiurants such as guanidine 
hydrochloride, urea or sodium dodecyl sulfate (SDS). These denaiurants must then be 
removed irom the solubilized protein preparation to allow the recovered protein to rcnaiurc 
(refold). Not all proteins will refold into an active conformation after solubilization in a 
denaturant and removal of the dcnaturani. Many proteins precipiiaic upon removal of the 
denaturant. SDS may be used to solubilizc inclusion bodies and will maintain the proteins in 
solution at low conceniraiion. However, dialysis will not always remove all of the SDS (SDS 
can form micelles which do noi dialyze out): therefore. SDS-solubilized inclusion bodv 
protein is soluble but not refolded, 

A distinction is drawn between proteins which arc soluble ( i.e.. dissolved) in a 
solution devoid of signiilcanl amounts of ionic detergents SDS) or denaiurants (c'.,i,'.. 
urea, guanidine hydrochloride) and proteins which exist as a suspension of insoluble protein 
molecules dispersed within the solution. A soluble protein will not be removed from a 
solution containing ihe proiein by cenirifugalion using conditions sufncieni lo remove bacteria 
present in a liquid medium [i.e., cenirifugation at 12.()()() \ u for 4-5 minutes), l-or example. 
10 lest whether two proteins, protein A and protein 13. are soluble in sokiiion, ihe two proteins 
are placed into a solution selected from the group consisting of PUS-NaC I (IMJS containing 
0-5 M NaCl). PBS-NaCI containing 0.2% Tween 20. PBS. PBS containing 0.2% Twecn 20. 
PBS-C (PBS containing 2 niM CaClO. PBS-C containing either 0.1 or 0.5 % Tween 20. PBS- 
C containing either 0.1 or 0.5% NP-40. PBS-C containing either 0.1 or 0.5% Triton X-IOO. 
PBS-C containing 0.1% sodium deoxycholate. The mixture containing proteins A and B is 
llicn cenirifuged at 5000 x g for 5 minutes. The supernatant and pellet formed b> 
centrifugaiion are then assayed for the presence of protein A and B. If protein A is found in 
the supernatant and not in the pellet (except for minor amounts (/.t\. less than 10%) as a 
result of trapping], protein is said to be soluble in the solution tested, if the majority of 
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proiein B is found in the pellet (i.e.. greater than 90%). then protein B is said to exist as a 
suspension in the solution tested. 

As used herein, the term "therapeutic amount" refers to that amount of antitoxin 
rec|uired to neutralize the pathologic effects of one or more clostridial toxins in a subject. 
5 rhe term "pyrogen" as used herein refers to a fever-producing substance. F^vrogens 

may be endogenous to the host ie.^., prostaglandins) or may be exogenous compounds {c^,. 
bacterial cndo- and exotoxins, nonbacterial compounds such as antigens and certain steroid 
compounds, etc.). The presence of pyrogen in a pharmaceutical solution may be detected 
using the i:'..S. Pharmacopeia (IJSP) rabbit fever lest (United States Pharmacopeia. Vol. XXII 

10 (1990) United Stales Pharmacopeial Convention. Rockvillc. MD, p. 151), 

The term "endotoxin" us used herein refers to the high molecular weighi complexes 
associated wiih the ouier membrane of gram-negaiive bacteria. Dnpurified cndoioxin contains 
lipids, proicins and carbohydrates. Highly purified endotoxin does not contain protein and is 
referred lo as lipopolysaccharidc (LPS). Because unpuriilcd cndoioxin is of concern in ihc 

1.^ productiiMi of pharmacculical compounds {e.^.. proteins produced in £. coli using recombinant 
DNA tcchnolog\ ). the term endotoxin as used herein refers to unpurified endotoxin. Bacterial 
cndt)toxin is a well known pyrogen. 

As used herein, the term "endotoxin-frce" when used in reference to a composition to 
he administered parcnterally (with the exception of intrathecal administration) lo a host means 

20 thai the liosc to be delivered contains less than 5 FUi/kg body weighi (FDA Ciuidelines for 
Parenteral Drugs (December l^)87)|. .Assuming a weight of 70 kg Ibr an adult human, the 
(lose must contain less than 350 liV lo meet FDA Guidelines for parenteral administration, 
l-ndoioxin levels are measured herein using the l.imulus Amcbocyte Lysaie (LAI.) test 
(Limulus Ameboeyie Lysate P\ rochrome'^*. Associates of Cape C od. Inc. Woods Mole. MA). 

25 l o measure endotoxin levels in preparations of recombinant proteins, 0.5 ml of a solution 

comprising 0,5 mg of purified recombinant protein in 50 mM NaPOj. pH 7.0. 0.3M NaCI and 
10% glycerol is used in the LAL assay according to the manufacturer's instructions for the 
endpoint chromogenic without diazo-coupling method (the specillc components of ihe buffer 
containing recombinant proiein to be analyzed in the LAL lest are not imporiani: any buffer 

.^0 having a neutral pH may be employed (see for example, aliernaiive buffers employed in 

Lxamples .'i4, 40 and 45) |. Compositions containing less than or equal lo lhan 250 endotoxin 
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units (EUVmg of purified recombinant protein are herein defined as "substantially endotoxin- 
free." Preferably the composition contains less than or equal to 100. and most preferably less 
than or equal to AO. (KUVmg of purified recombinant protein. Typically, administration of 
bacterial toxins or toxoids to adult humans for the purpose of vaccination involves doses of 
about 10-500 ny protein/dose, fhercfore. administration of 10-500 of a purified 
rccombinam protein to a 70 kg human, wherein said purified recombinant protein preparation 
contains AO F,U/mg protein, results in the introduction of only 0.6 to .10 lUJ (i.e.. 0.2 lo 8.6% 
of the maximum allowable endotoxin burden per parenteral dose). Administration of 10-500 
Mg of a purified recombinant protein to a 70 kg human, wherein .said purified recombinant 
protein preparation contains 250 EU/mg protein, results in the inu-oduciion of only 2.5 to 125 
l:U li.c. 0.7 lo .16% of the maximum allowable endotoxin burden per parentenil dose). 

The l...\L test is accepted by the U.S. FDA us a means of detecting bacterial 
endot«)xins (21 C.V.R. ,sjj fiw).100 -105). Studies have .shown that the LAI. test is equivalem 
or superior to the U,SI» rabbit pyrogen test for the detection of endotoxin and thus the l.AI. 
test can be used as a surrogate for pvrogenicity studies in animals |I-.C. I'erason. /Vn^.i^t-m. 
ciuhioxinx. l.AI. icstinK (.iinl ilvpyroficnutinn. Marcel Dekkcr. Now York (10X5). pp.1 50-155). 
llie I- DA lJurcau of IJiologics accepts the LAL as.s:iy in place of the USP rabbit pyrogen test 
.so long as the LAL assay utilized is shown to be as .sensitive a.s. or more sen.sitive as the 
rabbit icst (l ed. Reg.. 38. 261.10 ( 1980)1. 

I he lerni "monovalent" when used in reference to a clostridial \accinc refers to a 
v accine w hich is capable of provoking an immune response in a host animal directed again.st a 
single IV pe of clostridial toxin. For example, if immuni/aiion of a host with C. hiniilimim 
type A loxin vaccine induces antibodies in the immunized ho.st which protect again.si a 
challenge vviih type A loxin but noi against challenge with type H. C. D. H. I- or (i toxin.s. 
then the t\ pe A vaccine is said lo be monovalent. In contrast, a "mullivalenl" vaccine 
provokes an immune response in a host animal directed against .several (/.c. more than one) 
clostridial toxins. For example, if immunization of a host with a vaccine comprising C. 
houiliniim type A and U toxins induces the production of antibodies which proleei the host 
agaiiLst a challenge with both type A and B toxin, the vaccine is said lo be multivalent (in 
particular, this hypothetical vaccine is bivalent). 



- 18 - 



WO98Am540 



PCT/US97/15394 



As used herein ihc term "immunogenically-effeciiN e amount" refers lo that amouni of 
an immunogen required to invoke the production of protective levels of antibodies in a host 
upon vaccination. 

The term "protective level", when used in reference to the level of antibodies induced 
5 upon immunization of the host with an immunogen which comprises a bacterial toxin, means 
a level of circulating antibodies sufficient to protect the host from challenge with a lethal dose 
ol' the toxin. 

As used herein the terms "protein" and "polypeptide" refer to compounds comprising 
amino acids joined \ ia peptide bonds and are used interchangeably. 

The terms "toxin" and "neurotoxin" when used in reference to toxins produced by 
members (i.e., species and strains) of the genus Chsihdium are used interchangeably and 
refer lo the proteins which arc poisonous to nerve tissue. 

'I'he term "receptor-binding domain" when used in reference to a ( '. htuulinnni toxin 
refers to the carboxy-ierminal portion of the heavy chain (H^ or the C iVagnienl) of the toxin 

15 which is presumed to be responsible lor the binding of the active loxin iic the derivative 

toxin comprising the H iind L chains joined via disulfide bonds) lo receptors on the surface of 
synapiosomcs. The receptor-binding domain for ('. homlimim type A loxin is defined herein 
as comprising amino acid residues X61 through 1296 of SF-Q II) N():28. The receptor- 
binding domain for (* hotulinum type B loxin is defined herein as comprising amino acid 

20 residues 84X through 12^)1 of SHQ H) NO:40 (strain Ekiund I7B). The recepior-hinding 
dtunain of ( '. hotulhwm type CI loxin is defined herein as comprising amino acid residues 
X56 through 12^1 of SliQ ID N():6(). The receptor-binding domain of (' hotulmwv type D 
loxin is dcllned herein as comprising amino acid residues 852 through 1276 ofSl-Q II) 
N():66. fhe receptor-binding domain ol'C. hnndhnim type H loxin is defined herein as 

25 comprising amino acid residues 835 through 1250 of SKQ ID NO:50 (Beluga strain). The 
receptor-binding domain of C hotulinum type F toxin is defined herein as comprising amino 
acid residues 853 through 1274 of SEQ ID N0:71. The receptor-binding domain of (' 
Ixfiulinum type (j loxin is defined herein as comprising amino acid residues 853 through 1297 
of SF.O ID NO:77. Within a given serotype, small variations in the primary amino acid 

30 sequence ol the botulinal toxins isolated from different strains has been reported [Whelan ct 
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al. (1992). xupni and Minion (1995) Curr, Top. Microbiol. Immunol. 195:161-194]. The 
prcscnl invcnlion conlemplatcs fusion proteins comprising the rcccplor-bindini: domain of C. 
hoiulhwm toxins from serotypes A-Cj including the variants found among different strains 
within a given .serotype. The receptor-binding domains listed above arc used as llie prototype 
for each strain within a serotype. Fusion proteins containing an analogous region from a 
.strain other than the prototype strain are encompassed by ihc present invention. 

l-usion proteins comprising the receptor binding domain {i.e.. C* fragment) of boiulinal 
toxins may include amino acid residues located beyond the termini of ihe domains dcfmed 
above. Tor example, the pHisBolB protein contains amino acid residues 846-1291 of .SMQ ID 
N():4(): this fusion protein thus comprises the receptor-binding domain for C. hotulinum type 
B toxin as defmed above i/ t'.. Ile-848 llirough C;iu-129| ). .Similarly. pHisBoilI contains 
amino acid residues 827-1252 of SIZQ ID N0:5() and pIlisBolCi contains amino acid residues 
851-1297 of SFO ID N():77. Tims, both pHisBoiH and pHisMoKi fusion proteins contain a 
lew amino acids located beyond the N-terminus of the defmed receptor-binding domain. 

The terms "native gene" or "native gene sequences" are used to indicate DNA 
scLiucnccs encoding a particular gene which contain the same DNA sequences as found in the 
gene as isolated from nature. In contrast, "syntheiic gene sequences" arc DNA sequences 
which are used to replace the naturally occurring DNA sequences when the naturally 
iKXurring sequences cause expression problems in a given host cell, for example, naturally- 
occurring DNA sequences encoding codons which arc rarely used in a ho.st cell may be 
replaced it' .i^.. by site-directed mutagenesis) such that the synthetic DNA sequence represents 
a more frequcnily used codon. The native DNA sequence and the synthetic DNA sequence 
will preferably encode the same amino acid sequence. 

SUMMARY OF THE INVENTION 

The present invention relates to the production of polypeptides derived from toxins 
particularly in recombinant ho.st cells. In one embodiment, the present invention provides a 
host cell containing a recombinant expression vector, said vector encoding a protein 
comprising at least a portion of a ClostnJhim hotulinum toxin, said toxin selected from the 
group consisting of type B toxin and type E toxin. 1he present in\'ention is not limited by 
the nature of sequences encoding ptirtions of the ( ' homlinum toxin, l he.se sequences may be 
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derived from the native gene sequences or alternatively they may comprise synthetic gene 
sequences. Synthetic gene sequences are employed when expression of the native gene 
sequences is problematic in a given host cell (c^.i;., when the native gene sequences contain 
sequences resembling yeast transcription termination signals and the desired host cell is a 
yeast cell). 

In one embodiment, the host cell is capable of expressing the recombinant C\ 
hoiulhwm toxin protein at a level greater than or equal to 2% to 40% of the total cellular 
protein and preferably at a level greater than or equal to 5% of the total cellular protein. In 
another embodiment, the host cell is capable of expressing the recombinant hotulimim 
toxin protein as a soluble protein at a level greater than or equal to 0.25% of the total cellular 
protein and preferably at a level greater than or equal to 0.25% to 10% of the total cellular 
protein. 

The present inx cntion is not limited by the nature of the host cell employed lor the 
production ol' recombinant ('. hondiniim toxin proteins. In a preferred cmiiodimeni. the host 
cell is an 11. coli cell. In another preferred embodiment, the host cell is an insect cell: 
particularly preferred in.scci liosi cells are SpoUopivra fru^ipenla (SW) cells. In another 
preferred embodiment, the host cell is a yeast ceil: particularly preferred \cast cells are Pichiu 
juisioris cells. 

In another embodiment, the invention provides a host cell containing a recombinant 
expression \cctor. said vector encoding a fusion protein compri.sing a non-toxin protein 
sequence :ind at least a portion of :i Closindium homlinum toxin, said toxin .selected from the 
uroup consisting of type B toxin and type L: toxin. The invention is not limited by the nature 
i>!' the portion of the Closnidiuw hofu/inum toxin selected, in a preferred embodimeni, the 
portion of the toxin comprises the receptor binding domain {i.e.. the C fragment of the toxin). 
The present in\ ention is not limited b\ the nature of the non-toxin protein sequence 
employed. In a preferred embodiment, the non-toxin protein sequence comprises a poly- 
histidine tract. A number of alternative fusion tags or fusion partners are known to the art 

MB1\ (iST. protein A. etc.) and may be employed for the production of fusion proteins 
comprising a portion of a botulinal toxin. 
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The prcsenl inveniion further provides a vaccine comprising a fusion proiein, said 
fusion proiein comprising a non-toxin protein sequence and ai least a portion of a Closn idium 
houtlimtm loxin. said toxin selected from the group consisting of type B toxin and lype H 
toxin. The vaccine may be a monovalent vaccine {ic. containing only a loxin B fusion 
5 proiein or a toxin E fusion protein), a bivalent vaccine (/.t\, containing both a toxin B fusion 
proiein and a loxin E fusion proiein) or a irivalcnl or higher \alcncy vaccine. In a preferred 
embodiment, the toxin B fusion proiein and/or loxin E fusion protein is combined wiih a 
fusion proiein comprising a non-toxin proiein sequence and at least a portion of CloMriJium 
hntulhwm lype A loxin. The present invention is not limited by the nature of the portion of 

10 the Ciostriilium hotulimnn loxin selected. In a preferred embodiment, the portion of ihe loxin 
comprises the receptor binding domain (/.c\. the C fragment of the toxin), riie present 
inveniion is not limited by ihe nature of the non-toxin proiein sequence employed. In a 
preferred embodiment, ihe non-toxin protein sequence compri.ses a poly-histidine tract. .\ 
number o( alternative fusion lags or fusion partners are known to the art (<.^.t.^. .MBP. (jST. 

15 proiein j\. etc.) and may be employed for the generation of fusion proteins comprising 

\aecines. When a fusion partner (/.c. the non-loxin protein .sequence) is employed for the 
production of a recombinant ( * hotulinul toxin proiein. the fusion partner ma> he removed 
from ihe recombinanl (' hotulinul loxin protein if desired (/.c, prior to adminisiraiion of ihe 
proiein to a subjecl) using a variety of methods known to the art digestion of fusion 

20 proteins containing TaetorXa or thrombin recognition sites with ihc appropriate enzyme). A 
number ol' ihe pLTHis vectors employed herein provide an N-ierminal his-tag followed by a 
l-aciorXa cleavage site (see Example 28a): the botulinal C fragment sequences follow the 
I'aciorXa site and thus. I'aeiorXa can be used to remove ihe his-tag from ihe boiulinal fusion 
proiein. In a preferred embodiment, the vaccine is substantially endotoxin-l'rec, 

25 The present invention is not limited by the method employed for ihc generation of 

vaccine comprising fusion proteins comprising a non-toxin proiein sequence and ai least a 
portion of a i losnidium hotulimmi toxin. The fusion proteins may be produced by 
recombinant DNA means using either native or syniheiic gene sequences expressed in a host 
cell. The present invention is not limited to the production of vaccines using recombinant 

30 host cells: cell free //; vhro iranscripiion/translaiion systems may be employed for ihe 
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expression of the nucleic acid constructs encoding the fusion proteins of the present invention. 
An example of such a cell-free system is the commercially available TnT™ Coupled 
Reticulocyte Lysate System (Promeiia Corporation. Madison. WI). Alternatively, the fusion 
proteins of the present invention may be generated by synthetic means lie,, peptide 
synthesis). 

I he present invention further provides a method o!" generating antibody directed 
against a Clo.snu/hmi hotulinum toxin comprising: a) providing in any order: i) an antigen 
comprising a fusion protein comprising a non-toxin protein sec|ucnce and at least a portion of 
a Ch.sinclntm hontlinum toxin, said toxin selected from the group consisting of type U toxin 
and type li toxin, and ii) a host; and b) immunizing the host with the antigen so as to 
generate an antibody. In a preferred embodiment, the antigen used to immunize the host also 
contains a liision protein comprising a non-toxin protein sequence and at leasi a portion of 
i 'losthilitim hoiulimim type A toxin. I he present invention is not limited b\ the nature of the 
portion of the ChstnUitmi homlinum toxin selected. In a preferred embodimcni. the portion 
ol the toxin comprises the receptor binding domain {i.e.. the C fragment of the toxin). 1 he 
present invention is not limited by the nature of the nun-toxin protein sequence employed. In 
a preferred embodiment, the non-toxin protein sequence comprises a poly-histidine tract. A 
number of alternative fusion tags or lusion partners arc known to the an (e .i;.. MBP. GST. 
protem A. etc.) and may he employed for the generation of lusion proteins comprising 
vaccines. When a fusion partner {i.e., the non-toxin protein sequence) is employed for the 
production of a recombinant C boiulinal toxin protein, the fusion partner may be removed 
Irom the recombinant ( \ hoiulimtl toxin protein if desired (/.f.. prior to administration of the 
protein to a subject) using a variety of methods known to the art digestion of fusion 
proteins containing PactorXa or thrombin recognition sites with the appropriate enzyme). 

The present invention is not limited by the nature of the host employed for the 
production of the antibodies of the invention. In a preferred embodiment, the host is a 
mammal, preferably a human, The antibodies ol* the present invention may be generated 
using non-mammalian hosts such as bird.s, preferably chickens. In a preferred embodiment 
the method of the present invention further comprised the step ci i>f collecting the anlibodici; 
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from ihe host. In yel another embodimeni, the method of the present invention further 
comprises the step d) of purifying the antibodies. 

The present invention further provides antibodies raised according to the above 
methods. 

The present invention further contemplates multivalent vaccines comprising at least 
two recombinant i \ hotulimun toxin proteins derived from the group consisting of C. 
hontliiuim serotypes A. B, C. D. E. F. and 0. The invention contemplates bi\alcnt. trivalent, 
quadravalcnt, peniavalcnt. hcptavalenl and .septivaieni vaccines comprising recombinant C *. 
hotiilinum toxin proteins. iVeferably the recombinant hoiulinum toxin protein comprises 
10 the receptor binding domain (i.e,. C fragment) of the toxin. 

DESCRIPTION OF THE INVENTION 

The present invention contemplates vaccinating humans and other animals with 
polypeptides derived from hofulinum neurotoxins which are substantially endotoxin-free. 
15 These boiulinal peptides are also useful for the production of antitoxin. Anti-botulinal toxin 
antitoxin is useful for the treatment of patients effected by or at risk of symptoms due to the 
action ol'C hofulinum toxins, fhe organisms, toxins and individual steps of the present 
invention are described separately below. 

20 I. Clostridium Species, Clcstridial Diseases And Assciciatcd 1'oxins 

A preferred embodiment of the method of the present inv ention is directed toward 
obtaining antibodies against Clostvnihim species, their toxins, enzymes or other metabolic by- 
products, cell wall components, or synthetic or recombinant versions of any of these 
compounds. It is contemplated that these antibodies will be produced by immunization of 
25 humans or other animals. It is not intended that the present invention be limited to any 

particular toxin or any species of organism. In one embodinieni. toxins from all Clostridntm 
species are contemplated as immunogens. Examples of these toxins include the neuraminidase 
toxin of ( Imtyrivum, C xorc/ellii toxins ITf and I/f. toxins A. U. C\ I). IL I . and (i of ( '. 
hoiulinum and the numerous pcrfrm^ens toxins. In one preferred embodiment, toxins A, 



-24. 



wo 98/08540 



PCT/US97/15394 



B. and H of C. hondimtm are contemplated as immunogens. Table 2 above lists various 
Clo.sindium species, their toxins and some antigens associated with disease. 

TABLE 2 

Clostridial Toxins 



Organ ism 


Toxins and Disease-Associated Antigens 


( hnfti/hwm 


A. B. C. C,. D. 0. F. G 


<\ hmru'tiift 


Neuraminidase 




A. B. Enierotoxin (not A nor B). Motility Alierinsz Fiictor. Low 
Molecular Weight Toxin. Others 




a, p. c. 1. y. 6. V. 0. k*. k, n. u 


C. stin/citt 

( * hi/crinctuttns 


HT IT. a, n. y 


( '. lun vt 


a. \y V. 6. c. C V, 0 


(' scpucum 


a. p. v. o 


i '. hisitilyiU tm 


«. [y y. »x t: plus «iddiiioiuil enzymes 


( c/intt\'nct 


ii. {\. y. 0 



It is not intended that antibodies produced against one toxin will only be used against 
ihal toxin, it is contemplated that antibodies directed against one toxin ( pcrfrinj^ens 
i\pe A enteroioxin) may be used as an elTcctive therapeutic against one or more loxin(s) 
piodiiced b\ other members of the genus Clostridium or other H)xin producing organisms 
(i'..t:.. Ihwillus vvrcus, Staphylnvoccm aureus. Slrcpiococcus muians. Acinetobucter 
calcooccticits. Pseudnmomis acrui^inosa, other Pscudomnnm species, cic). It is further 
contemplated that antibodies directed against the portion of the toxin which binds to 
mammalian membranes U'.i,'.. ( pcrjnn\icns enteroioxin A) can also be used against other 
organisms, it is conictnplaled that these membrane binding domains are produced 
synthetically and used as immunogens. 

II. Obtaining Antibodies In Non-Mammals 

A preferred embodiment of the method of the present invention for obtaining 
aniibodies inNofves immunization. However, it is also conicmplaied that antibodies could be 
obtained from non-mammals without immunization. In the ca.se where no immunization is 
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contemplated, the present invention may use non-mammals with preexisting antibodies to 
toxins as well as non-mammals thai have antibodies lo whole organisms by virtue of reactions 
with the administered antigen. An example of the latter involves immunization with synthetic 
peptides or recombinant proteins sharing epitopes with whole organism components. 

In a preferred embodiment, the method of the present invention contemplates 
immunizing non-mammals with bacterial toxin(s). It is not intended that the present invention 
be limited to any particular toxin. In one embodiment, toxin from all clostridial bacteria 
sources {.\vc Table 2) are contemplated as immunogens. Txamples of these toxins are C. 
hittyriamt neuraminidase toxin, toxins A. B. C. D. F., F, and Ci from ( hontlimim . 
C \ pcrfi'ini^cns loxins (x. \). and k and (\ sordcllii toxins IIT and 1.1*. In a preferred 
embodiment, hoiulinum loxins A. B. C. D. E. and F (or fragments ihereoO are 
coniemplated as immunogens. 

A particularly preferred embodimcni involves the use of bacterial toxin protein or 
I'raumenls of toxin proteins produced by molecular biological means {i.e.. recombinant toxin 
proteins). In a preferred embodiment, the imnninogcn comprises the recepu)r-binding domain 
(/.t',. the -50 kl) carboxy-tcrminal portion of the heavy chain: also referred \o as the C" 
tVagment) oI'C. hoiulinum ser{>lype A neurotoxin produced by recombinant DNA technology. 
In another preferred embodiment, the immunogcn comprises the receptor-binding domain of 

hofitlinitm serotype B neurotoxin produced by recombinant DNA technt)k>ii\ . In yet 
another preferred embodiment, the immunogcn comprises the receptor-binding domain region 
of C. hiUitlinum serotype E neurotoxin produced by recombinant DNA teehnolog\ . In yet 
another preferred embodiment, the immunogcn comprises the receptor-binding domain region 
ol'C. hoiulinum .serotype CI neurotoxin produced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogcn comprises the receptor-binding domain region 
ofC'. hofulinum serotype C2 neurotoxin produced by recombinant [)NA technology. In yet 
another preferred embodiment, the immunogcn comprises the receptor-binding domain region 
of hoiulinum .serotype D neurotoxin produced by recombinant DNA technologv. In yet 
another preferred embodiment, the immunogcn comprises the receptor-binding domain region 
of luuulinum serotype F neurotoxin prt)duced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogcn comprises the receptor-binding domain region 
of hofulinum serotype CJ neurotoxin produced by recombinant DNA technology. In a 
preferred embodiment, the recombinant botulinal toxin proteins are expressed as fusion 
proteins (c.^'.. as histidinc-tagged proteins). In a still further preferred embodiment, the 
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immunoaen is a mullivaleni vaccine comprising the receptor-binding domain region oi C. 
homlimtm toxin from two or more toxins selected from the group consisting of type A. type 
B. type C (including CI and C2), type D. type E. and type F toxin. 

When immuniziition is used, the preferred non-mammal is from the class .4vc'.v. All 
birds are contemplated (f. duck, ostrich, emu. turkey, etc.). A preferred bird is a chicken, 
hiiportanily. chicken antibody does not fix mammalian complement. {Sea H.N. Benson ci aL. 
.1. Immunol. 87:616 (l%l).] Thus, chicken antibody will normally not cause a complement- 
dependent reaction. |A.A. Benedict and K. Yamaga. 'lmnnmo\rlohulins ami Aniihody 
Proihfamn in Avian Species." in Compara/ive Immunolo^* Marchaloni. cd.), pp. 335- 
375. Blackwcll. Oxford ( 1966).) Thus, ihe preferred antitoxins of the present invention will 
not exhibit complement-related side effects observed with antitoxins known presently. 

W hen birds are u.scd. it is contemplated that the antibody will be obtained from either 
the bird scrum or the egg. A preferred embodiment involves collection of the antibody IVom 
the cuu. Laying hens transport immunoglobulin lo the egg yolk ("IgY") in concentrations 
equal to or exceeding thai found in scrum. [See R. Patterson ct <//.. .1. Immunol. 89:272 
(1962): and S.B. Carroll and B.D. Siollar. J. Biol. Chcm. 258:24 (I983).| in addition, the 
large N'olumc of egg yolk produced \asily exceeds the \ olumc of scrum thai can be safely 
obtained from the bird over any given time period. Finally, the antibody Irom eggs is purer 
and more homogeneous: there is far less non-immunoglobulin protein (as compared to serum) 
and only one class of immunoglobulin is transported to the yolk. 

When considering immunization with toxins, one may consider modification of the 
toxins to reduce the toxicity. In this regard, it is not intended that the present invention be 
limited by immunization with modified toxin. Unmodified ("native") toxin is also 
contemplated as an immunogen. 

It is also not intended that the present invention be limited by the type of modification 
- if modification is used. The present invention contemplates all types of toxin modification, 
including chemical and heal treatment of the toxin. The preferred modification, however, is 
formaldch\de treatment. 

It is not intended that the present invention be limited to a particular mode of 
immunization: the present invention contemplates all modes of immunization, including 
subcutaneous, intramuscular, intraperitoneal, and intravenous or intravascular injection, as well 
as per as administration of immunogen. 
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The present invention further contemplates immunization with or without adjuvant. 
(Adjuvant is defined as a substance known to increase the immune response to other antigens 
when administered with other antigens.) If adjuvant is used, it is not intended that the present 
invention be limited to any particular type of adjuvant - or that the same adjuvant, once used. 
5 be used all the time. While the present invention contemplates all types of adjuvant, whether 
used separately or in combinations, the preferred use of adjuvant is the use of Complete 
Freund s Adjuvant followed sometime later with Incomplete I reund's Adjuvant. Another 
preferred use of adjuvant is the use of Gerbu Adjuvant. The invention also contemplates the 
use ol' RIBI fowl adjuvant and Qt"! A adjuvant. 

When immunization is used, the present invention contemplates a wide variety of 
immunization schedules. In one embodiment, a chicken is administered toxin(s) on day zero 
and subsequently receives toxin(s) in intervals thereafter. It is not intended that the present 
invention be limited by the particular intervals or do.ses. Similarly, it is not intended ihai ihe 
present inNcntion be limited to any particular schedule for collectinu antibody, fhc preferred 
15 collection time is sometime after day 100. 

Where birds are used and collection of antibody is perl'ormed by colleciinu eges. the 
ei^gs may be stored prior to processing for antibody. It is preferred that eggs be stored at 4°C 
for less than t)ne year. 

It is contemplated thai chicken antibody produced in this manner can he buffer- 
20 . cxiracicd and u.scd analytically. While unpurificd. this preparation can serve as a reference 
for activity of the antibody prior to further manipulations (c.i; . immunoaffmity purification). 

III. Increasing The Effectiveness Of Antibodies 

When purification is u.sed. the present invention contemplates purifying lo increase the 
25 effectiveness of both non-mammalian antitoxins and mammalian antitoxins. .Specifically, the 
present invention contemplates increasing the percent of toxin-reactive immunoglobulin. The 
preferred purification approach for avian antibody is polyethylene glycol (PI:(j) separation. 

fhe present invention contemplates that avian antibody be initially purified using 
simple, inexpensive procedures. In one embodiment, chicken antibody from eggs is purified 
30 by extraction and precipitation with POG. PEG purification exploits the differential solubility 
of lipids (which are abundant in egg yolks) and yolk proteins in high concentrations of Pl-Xi 
8000. [Poison CI uL. Immunol. Comm. 9:495 (1 980). J The technique is rapid, simple, and 
relatively inexpensive and yields an immunoglobulin fraction that is significantly purer in 
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terms of coiuaminaiing non-immunoglobulin proteins than the comparable ammonium sulfate 
tractions or mammalian sera and horse antibodies. The majority of the PEG is removed from 
the precipitated chicken immunoglobulin by treatment with elhanol. Indeed. PEG-purified 
antibody is sufficiently pure that the present invention contemplates the use of PI£G-purified 
antitoxins in the passive immunization of intoxicated humans and animals. 

IV. Treatment 

The present invention contemplates antitoxin therapy for humans and other animals 
intoxicated by bacterial toxins. A preferred method of treatment is by intravenous 
administration of anti-boutlinal antitoxin: oral administration is also contemplated for other 
clostridial antitoxins. 

A. Dosage Of Antitoxin 

It was noted by way of background that a balance must be struck when administering 
currently available antitoxin which is usually produced in large animals such as horses: 
suMlcient antitoxin must be administered to neutralize the toxin, but nol so nuich antitoxin as 
to increase the risk of untoward side cffccis. These side effects are caused by: i) patient 
sen.sitivity to foreign (c.i;. horse) proteins: ii) anaphylactic or immunogenic properties of non- 
immunoglobulin proteins: iii) the complement fixing properties of mammalian antibodies: 
and/or i\ ) the overall burden of foreign protein administered. It is extremely difficult to 
strike this balance when, as noted above, the deizree of intoxication (and hence the level of 
antitoxin therapy needed) can only be approximated. 

The present invention contemplates signillcantly reducing side effects so that this 
balance is more easily achieved. I reatment according to the present invention contemplates 
reducing side etfects by using PHG-puritied antitoxin from birds. 

In one embodiment, the treatment of the present invention contemplates the use of 
PH(i-purified antitoxin from birds. The use of yolk-derived. PHG-purified antibody as 
antitoxin allows for the administration of: I) non(mammalian)-complement-fixing. avian 
antibody: 2) a less heterogeneous mixture of non-immunoglobulin proteins: and 3) less total 
protein to deliver the equivalent weight of active antibody present in currently available 
antitoxins. The non-mammalian source ol' the antitoxin makes it useful for treating patients 
who are sensitive to horse or other mammalian sera. 
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B. Delivery Of Antitoxin 

Although it is not intended to limit the route of delivery, the present invention 
contemplates a method for antitoxin treatment of bacterial intoxication in wliich delivery of 
antitoxin is oral. In one embodimeni. antitoxin is delivered in a solid form (c'.,i;., tablets). In 
5 an alternative embodiment antitoxin is delivered in an aqueous solution. When an aqueous 
solution is used, the solution has sufficient ionic strength to solubilize antibody protein, yet is 
made palatable for oral administration. The delivery solution may also be buffered (c.^., 
carbonate buffer pH 9.5) which can neutralize stomach acids and stabilize the antibodies when 
the antibodies are administered orally. In one embodiment the delivery solution is an aqueous 

10 solution. In another embodiment the deliver)' solution is a nutritional Ibrmula. Preferably. 

the delivery .solution is infant formula. Yet another embodiment contemplates the delivery of 
lyopiiili/ed antibody encapsulated or microencapsulated inside acid-resisiant compounds. 

Methods of applying enteric coalings to pharmaceutical compounds are well known to 
ihe an |companies specializing in the coating of pharmaceutical compounds arc available: for 

15 example, fhe Coaling Place (Verona. Wl) and AAl (Wilmington. NC)]. Hnieric coatings 

which arc rcsisiani lo gastric lluid and whose release (/.<'.. dissolution of the coating to release 
the pharmaccuiical c(uiipound) is pH dependent are commercially available |for example, the 
polymcihacrylatcs LudragilH- I. and KudragilOO S (Rohm GmbH)]. liudragiiK' S is soluble in 
iniesiinal lluid IVom pll 7.0: this coating can be used to microencapsulate lyophilizcd anlitoxin 

20 antibodies and the particles are suspended in a .solution having a pll above or below pll 7.0 
for oral administration. The microparticles will remain intact and undi.ssolvcd until ihey 
reached the intestines where the intestinal pH would cause them to dissolve thereby releasing 
the aniiioxin. 

The invention contemplates a method of treaiment which can be administered for 
25 treaiment of acute intoxication. In one embodiment, anlitoxin is administered orally in either 

a delivery solution or in tablet form, in therapeutic dosage, to a subject intoxicated by the 

bacterial toxin which served as immunogen for the anlitoxin. 

The invention also contemplates a method of treatment which can be administered 

prophylactieally. In one embodiment, antitoxin is administered orally, in a delivery .solution, 
."^O in therapeutic do.sage, to a subject, to prevent intoxication of the subject by the bacterial toxin 

which serxed as immunogen for the production of antitoxin. In another embodiment. 

antitoxin is administered orally in solid form such as tablets or as microencapsulated particles. 

Microencapsulation of lyophilized antibody using compounds such as l:udragit® (Rohm 
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GmbH) or polyethylene glycol . which dissolve at a wide range of pH units, allows the oral 
administration of solid antitoxin in a liquid form (Le., a suspension) to recipients unable lo 
tolerate administration of tablets (t'.jtf.. children or patients on feeding tubes). In one preferred 
embodiment the subject is a child. In another embodiment, antibody raised against whole 
5 bacterial organism is administered orally to a subject, in a delivery .solution, in therapeutic 
dosage. 

V. Vaccinvs Against ClosrHdial Species 

The invention contemplates the generation of mono- and multivalent vaccines for the 
10 protection of an animal (particularly humans) against several clostridial species. Of particular 
interest are \'aecine.s which stimulate the production of a humoral imnninc response to ( 
hnntlinnm. ( \ leumi and i \ ciifjfidh in humans. The antigens comprising the \ accine 
preparation may be native or rccombinanily produced toxin proteins from the clostridial 
.species listed above. When toxin proteins are used as immunogens ihey are generally 
15 modified to reduce the toxicity. This modification may be by chemical or genetic (/.c. 

recombinant DNA technology) means, in general genetic detoxitlcaiion {i.e.. the expression 
of nontoxic fragments in a host cell) is preferred as the expression of nontoxic fragments in a 
host cell precludes the presence of intact, active toxin in the final preparation. However, 
when chemical modification is desired, the preferred toxin modification is formaldehyde 
20 treatment. 

The invention contemplates that recombinant C*. hoiulimim toxin proteins be used as 
antigens in mono- and multivalent vaccine preparations. Soluble, subsianiially endotoxin-frec 
recombinant C\ homlimmi toxin proteins derived from serotypes A. B and C may be used 
individually (i.e.. as mono-valent vaccines) or in combination [i.e. as a mulli-valent vaccine). 

2:) In addition, the recombinant howlinum toxin proteins derived from scrotpes A. B and I- 
may be used in conjunction with either recombinant or native toxins or toxoids from other 
serotypes ol C hondiimnu C diffidlc and C. /etani as antigens for the preparation of these 
mono- and multivalent vaccines. It is contemplated that, due to the structural similarity of (\ 
botulinwn and C. tcumi toxin proteins, a vaccine comprising L\ dijfwila and hotulinum toxin 

M) proteins (native or recombinant or a mixture thereof) be used to .stimulate an immune 
response against i\ Iwlulimmi C. lelcmi and C dij/icilc. 
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The preseni invention further coniemplales muili-valenl vaccines comprising two or 
more boiulinal loxin proteins selected from the group comprising recombinant ( hotulinum 
toxin proteins derived from serotypes A. B. C (including CI and C2). D. E. F and G. 
The adverse consequences of exposure lo boiulinal toxin would be avoided by 
5 immunization of subjects at risk of exposure to the toxin with nontoxic preparations which 
confer immunity such as chemically or genetically detoxified toxin. 

Vaccines which confer immunity against one or more of the toxin types A. B. 1^. 1-' 
and G would be useful as a means of protecting humans from the deleterious effects ol* those 
r. holulmum toxins known to affect man. Indeed as the possibility exists thai humans could 

10 be exposed lo any of the seven serotypes of C boiidimim loxin (t'.;^'., during biological 

warfare or the production of toxin in a laboratory selling)* multivalent vaccines capable of 
conferring immunity against loxin types A-G (including both CI and C2 toxins) would be 
useful for the protection of humans. Vaccines which confer immunity against one or more of 
the loxin types C. D and E would be useful for veterinary applications. 

15 fhe I'Joiulinal neurotoxin is synthesized as a single polypeptide chain which is 

processed into a heavy (H: ^100 kD) and a light (I.: -oO kD) chain by cleavage with proteolytic 
enzymes: these two chains are held together via disulfide bonds in the active loxin (referred to 
as derivative toxin) [B.R. DasGupla and 11. Sugiyama. Biochcm. Biophys. Res. Commun. 
48:108 (1972): reviewed in B.R. DasGupla, J. Physiol. 84:220 (1990). H. Sugiyama. 

20 Microbiol. Rev. 44:419 (1980) and C.L. Halheway. Clin. Microbiol. Rev. 3:66 (1990)]. The 
heavy chain of the active toxin is cleaved by trypsin lo produce two fragments termed \ \^ 
(also referred to as H, or C) and l l^j (also referred to as or B). I he H( fragment (-46 kl)) 
comprises the carboxy end of the II chain. The 11^. fragment (-49 kl)) comprises the animo 
end and remains attached to the L chain (UnL). Neither or \\^\. is toxic. \\^ competes 

25 with whole derivative toxin for binding lo synaplosomes and therefore Il( is said to contain 
the receptor binding site. The \ \ and H^^ fragments of boiulinal toxin are analogous to the 
fragments C and B of tetanus toxin which are produced by papain cleavage, f he C fragment 
of tetanus toxin has been shown to be responsible for the binding of tetanus toxin to purified 
gangliosidcs and neuronal cells (Malpern and Loflus. J. Biol. Chem. 288:1 1 188 (1993)]. 

30 Antisera raised against purified preparations of isolated boiulinal M and 1. chains have 

been shown lo protect mice against the lethal effects of the toxin: however, the effectiveness 
of the two antisera differ with the anti-U sera being more potent (H. Sugiyama. supra). 
While ihe different boiulinal toxins show structural similarity to one another, the dilTerent 
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serotypes are reported to be immunologically distinct (ie,. sera raised against one toxin type 
does noi cross-react to a significant degree with other types). Thus, the generation of 
multivalent vaccines may require the use of more than one type of toxin. 

( ' hoiulmum toxin genes from all seven serotypes have been cloned and sequenced 
5 (Minion (1995). supra): in addition, partial amino acid sequence is available for a number of 
( hniiii'miim toxins isolated from different strains within a given serotype. The hotulimtm 
toxins contain about 1250-1300 amino acid residues. On the DNA level, the overall degree of 
homology between C. houdinum serotypes A. B. C. D and K toxins averages between 50 and 
60% identity with a greater degree of homology beini^ found between H chain-encoding 

10 regions than between those encoding L chains [Whelan et ai ( 1992) Appl. Environ. 

Microbiol. 58:2345). The degree of identity between C. hotulintnn toxins on the amino acid 
level reflects the level ol* DNA sequence homology. The most divergent area of DNA and 
amino acid sequence is found within the carboxy-ierminal area of the various C. hotulinum I I 
chain i!enes. This portion of the toxin (i.e., l-f(.or the C iVagment) plays a major role in cell 

15 binding. As loxin from different serotypes is thought to bind to distinct cell receptor 
molecules, it is not surprising that the toxins diverge significantly over this region. 

Within a given serotype, small variations in the primary amino acid sequence of the 
bniulinal loxins isolated from different si rains has been reported | Whelan cv ciL (1992). supvci 
and Minion (1995), supra]. The present invention contemplates fusion proteins comprising 

20 portions of C hoiulinum loxins from serotypes A-G including the variants found among 
different .strains within a given serotype. The present invention provides oligonucleotide 
primers which may be used to amplify ihe C fragment or receptor-binding region of the toxin 
gene iVom various strains off. hondimmi serotype A. serotype B. serotype C (CI and C2). 
serotype D. serotype E. serotype F and serotype G. A large number of different strains of (' 

25 hoiulhumi serotype A. serotype B. serotype C. serotype D serotype E and serotype F are 

available from the American Type Culture Collection (ATCC: Rockville. MD). For example, 
the ATCC provides the following: Type A strains: 174 (ATCC 3502), 457 (ATCC 17862). 
and NCTC 7272 (ATCC 19397): Type B strains: 34 (ATCC 439). 62A (ATCC 7948). NCA 
213 B (ATCC 7949). 13114 (ATCC 8083). 3137 (ATCC 17780). 1347 (ATCC 17841). 2017 

30 (ATCC 17843), 2217 (ATCC 17844). 2254 (ATCC 17845) and VP 1731 (ATCC 25765); 

Type C strains: 2220 (ATCC 17782). 2239 (ATCC 17783), 2223 (ATCC 17784: a type C-fi 
.strain: C-p strains produce C2 toxin). 662 (ATCC 17849; a type C-a strain; C-a strains 
produce mainly CI toxin and a .small amount of C2 toxin). 2021 (ATCC 17850: a type C-a 
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Strain) and VPl 3803 (ATCC 25766); Type D strains: ATCC 9633, 2023 (ATCC 17851), and 
VPI 5995 (ATCC 27517); Type li strains: ATCC 43181. 36208 (ATCC 9564), 2231 (ATCC 
17786). 2229 (ATCC 17852). 2279 (ATCC 17854) and 2285 (ATCC 17855) and Type F 
strains: 2()2F (ATCC 23387). VPI 4404 (ATCC 25764), VPI 2382 (ATCC 27321) and 
5 I.angeland (ATCC 35415). Type Ci strain. 1 13/30 (NCFB 3012) may be obtained from the 
National Collection of Food Bacteria (NCFB. AFRC Institute of Food Research. Reading, 
United Kingdom). 

Purification methods have been reported for native toxin types A. B, C\ D. E, and I- 
[reviewed in G. Sakaguchi. Pharmac. Ther. 19:165 (I983)J. As ihe different botulinal toxins 

10 are structurally related, the invention contemplates the expression of any of the botulinal 
toxins (c.i:^ lypes A-G) as soluble recombinant fusion proteins. 

In particular, methods for purification of the type A botulinum neurotoxin have been 
developed |L.J. Moberg and H. Sugiyama. Appl. linviron. Microbiol. 35:878 (1978)]. 
Immunization of hens with detoxified purified protein results in the generation of neutralizing 

1."^ antibodies |B.S. Thalley ci al.. in DotuUnum and Tetcmm Nvuroioxim. B.R. DasCiupla. ed.. 
Plenum Press. New York (1993). p. 467]. 

The currently available ( ' hntulinum pentavalent vaccine comprising chemically 
inactivated {i.e. formaldehyde treated) type A. B. C, D and I: toxins is not adequate. The 
efficacy is variable (in particular, only 78% of recipients produce protective levels of anti-type 

20 B antibodies following administration of the primary scries) and immunization is painful 

(deep subcutaneous inoculation is required for administration), with adverse reactions being 
common (moderate to severe local reactions occur in approximately 6% of recipients upon 
initial injection: this number rises to approximately 1 1% ol* individuals who receive booster 
injections) | Informational Brochure for the Pentavalent (ABCI)I.*) Botulinum loxoid. Centers 

25 for Disea.se Control). Preparation of this vaccine is dangerous as active toxin must be handled 
by laboratory workers. 

In general, chemical detoxification of bacterial toxins using agents such as 
formaldeh\'de. glutaraldehyde or hydrogen peroxide is not optimal for the generation of 
vaccines or antitoxins. A delicate balance must be struck between too much and too little 

30 chemical modification. If the treatment is insufficient, the vaccine may retain residual 

toxicity. If the treatment is too exce.ssivc. the vaccine may lose pt)tcncy due to destruction of 
natiNe immunogenic determinants. Another major limitation of using botulinal toxoids for the 
generation of antitoxins or vaccines is the high production expense. Vox the above reasons. 
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the development of methods for the production of nontoxic but immunogenic C. hotulinum 
loxin proteins is desirable. 

The C. hotulinum and C. tetanus toxin proteins have similar structures [reviewed in 
F.J. Schantz and E.A. Johnson. Microbiol. Rev. 56:80 (I W2)l. The carboxy-terminal 50 kD 
fragment of the tetanus toxin heavy chain (fragment C) is released by papain cleavage and has 
been shown to be non-toxic and immunogenic. Recombinant tetanus toxin fragment C has 
been developed as a candidate vaccine antigen [AJ. Makoff c/ al., Bio/Technology 7:1043 
{\^m)\. Mice immunized with recombinant tetanus toxin fragment C were protected from 
challenge with lethal doses of tetanus toxin. No studies have demonstrated that the 
recombinant tetanus fragment C protein confers immunity against oilier boiulinal toxins such 
as the C. hotulinum toxins. 

Recombinant tetanus fragment C has been expressed in £. coli (A.. I. Makoff c//.. 
Bio/Tcchnology. supra and Nucleic Acids Res. 17:10191 (1989): J,L. llalpern et ciL. Infect. 
Immun. 5X:IO04 (I990)J. ycasl [M.A. Komanos et uL. Nucleic Acids Res. 19:1461 (I99l)| 
and baculovirus |I.G. Charles c/ uL. Infect. Immun. 59:1627 (1991)]. Synthetic tetanus toxin 
genes had to he constructed to facilitate expression in yeast (M.A. Romanos ct uL. supra) and 
/:. coli \AJ. Makoffc'/ <//., Nucleic Acids Res., supra], due to the high A-T content of the 
tetanus io.\in uenc sequences. High A-T content is a common feature of clostridial genes 
(M.R, PopolTc'/ at.. Infect. Immun. 59:3673 (1991); H.F. LaPenoliere ct aL. in Botulinum and 
Tetanus Scnrnioxins. B.R. DasGupta, ed.. Plenum Press. New York (1993). p. 463 1 which 
creates expression dil'tlcullies in £. coli and yea.st due primarily to altered codon usage 
frequency and fortuitous polyadenylatiun sites, respectively. 

Fhe C* fragment of the C. hotulinum type A neurotoxin heavy chain has been evaluated 
as a \accine candidate. The i \ hotutmum type A neurotoxin gene has been cloned and 
sequenced (D.L. Thompson ct oL, Eur. J. Biochem. 189:73 (1990)1. The C fragment of the 
type A loxin was expressed as either a fusion protein comprising the botulinal C fragment 
fused with the maltose binding protein (MBP) or as a native protein |H.F. LaPenotiere ct aL. 
(1993) supra, II. F, LaPenotiere ct aL. Toxicon. 33:1383 (1995) and Middlebrook and Brown 
(1995). Curr. lop. Microbiol. Immunol. 195:89-122]. The plasmid construct encoding the 
native protein was reported to be unstable, while the fusion protein was expressed primarily in 
inclusion bodies as insoluble protein. Immunization of mice with crudely puriHed MBP 
fusion protein resulted in protection against IP challenge with 3 LD,o doses of toxin 
(LaPenotiere ct al„ (1993) and (1995), supral However, this recombinant ( '. hotulinum type 
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A loxin C fragmenl/MBP fusion protein is not a suitable immunogen for the production of 
vaccines as ii is expressed as an insoluble protein in £ afli. Furthermore, this recombinant 
C hotulinum type A toxin C fragmenl/MBP fusion protein was nol shown to be substantially 
Free of endotoxin contamination. Experience with recombinant (\ hotulinum type A toxin C 
5 fragment/MBP fusion proteins shows that the presence of the MBP on the fusion protein 
greatly complicates the removal of endotoxin from preparations of the recombinant fusion 
protein {.\vc Ex. 24. infra). Expression of a synthetic gene encoding hondimm type A 
toxin C fragment as a soluble protein excreted from insect cells has been reported 
[Middlcbrook and Brown ( 1995), supra]: no details regarding the level of expression achieved 

10 or the presence of endotoxin or other pyrogens were provided. Like the insoluble protein 

expressed in t, coti, immunization with the recombinant protein produced in insect cells was 
reported to protect mice from challenge with C. hotulinum toxin A. 

Inclusion body protein must be solubilized prior to purillcation and/or administration 
to a host. The harsh treatment of inclusion body protein needed to accomplish this 

15 solubilization may reduce the immunogenicily of the purified protein. Ideally, recombinant 
proteins to he used as vaccines are expressed as soluble proteins at high levels (i.e.. greater 
than or equal to about 0.75% of total cellular protein) in E. colt or other host cells (c'.;^'.. 
yeast, insect cells, etc.)- fhis facilitates the production and isolation of sufficient quantities of 
the immunogen in a highly purified form (/,t\. substantially free of endotoxin or other 

20 pyrogen contamination). The ability to express recombinant loxin proteins as soluble proteins 
in £. vnli is advantageous due to the low cost of growth compared to insect or mammalian 
tissue culture cells. 

The ( ' hotulinum type B neurotoxin gene has been cloned and sequenced from two 
strains of hotulinum type B (Whelan at al. (1992) Appl. Environ. Microbiol, 58:2345 

25 (Danish strain) and Hutson at ai (1994) Curr. Microbiol. 28:101 (Ekiund 17B strain)]. The 
nucleotide sequence of the toxin gene derived from the Ekiund I7B strain (ATCC 25765) is 
available from the EMBL/CienBank sequence data banks under the accession number X71343: 
the nucleotide sequence of the coding region is listed in SEQ ID N():39. The amino acid 
sequence of the C*. hotulinum type B neurotoxin derived from the strain Ekiund 17B is listed 

30 in SEO ID NO:40. The nucleotide sequence of the C. hotulinum serotype B toxin gene 

derived from the Danish strain is listed in SEQ ID N0:41. The amino acid sequence of the 
(' hotulinum type B neurotoxin derived from the Danish strain is listed in SEQ ID NO:42. 
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The C. boitdimm type B neurotoxin gene is synthesized as a single polypeptide chain 
which is processed to form a dimer composed of a light and a heavy chain linked via 
disulfide bonds. The light chain is responsible for pharmacological activity (i.e.. inhibition of 
the release of acetylcholine at the neuromuscular junction). The N-ierminal portion of the 
heavy chain is thought to mediate channel formation while the C-terminal portion mediates 
toxin binding: the type B neurotoxin has been reported to exist as a mixture of predominantly 
single chain with some double chain ( Whelan ei ciL. supra). The 50 kD carboxy-ierminal 
portion t)f the heavy chain is referred to as the C Iragmcnt or the I I,. domain. The present 
invention reports for the first time, the expression of the C fragment of C. hotulinum type B 
toxin in heterologous hosts (c./:.. £. coli). 

The C. hamlinum l\'pc F- neurotoxin gene has been cloned and sequenced from a 
number of different strains [Poulet a uL (1992) Biochem. Biopliys. Res. Commun. 183;]07: 
Whelan ei al. (1992) liur. J. Biochem. 204:657: and Fujii a al. {\993) J. Gen. Microbiol. 
i39:70|. The nucleotide sequence of the type E toxin gene is available from the liMBL 
sequence data bank under accession numbers X62089 (strain Beluga) and X62683 (strain 
NCTC 11219): the nucleotide sequence of the coding region (strain Beluga) is listed in SHQ 
ID NO:45. The amino acid sequence of the C hoiulimim type E neurotoxin derived from 
strain Bcluua is listed in SEQ ID NC):46. The type E neurotoxin gene is synthesized as a 
single polypeptide chain which may be converted to a double-chain form {i.e.. a heavy chain 
and a light chinn) by cleavage with trypsin: unlike the type A neurotoxin, the type F 
neurotoxin exists essentially only in the single-chain form. The 50 kD carboxy-terminal 
portion of the heavy chain is referred to as the C fragment or the H,- domain. The present 
invention reports for the first time, the expression of the C fragment off.', honttimim type F 
toxin in heterologous hosts (c.ir.. E, coll). 

The hondinum type CI. D. F and G neurotoxin genes have been cloned and 
.sequenced. The nucleotide and amino acid .sequences of these genes and toxins are provided 
herein. The invention provides methods for the expression of the C fragment from each of 
these toxin genes in heterologous hosts and the purification of the resulting recombinant 
proteins. 

The subject invention provides methods which allow the production of soluble 
hoiulimim toxin proteins in economical host cells (e.^i.. E. coii). In addition the subject 
invention provides methods which allow the production of soluble bolulinal toxin proteins in 
yeast and insect cells. I-urther. methods for the isolation of purified soluble C \ hotulinunj 
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toxin proteins which are suitable for immunization of humans and other animals are provided. 
These soluble, purified preparations of C botulinum toxin proteins provide the basis for 
improved vaccine preparations and facilitate the production of antitoxin. 

When recombinant clostridial toxin proteins produced in gram-negative bacteria (e.^;.. 
5 £ coli) arc used as vaccines, they are purified to remove endotoxin prior to administration to 
a host animal. In order to vaccinate a host, an immunogcnically-effective amount of purified 
substantially cndotoxin-free recombinant clostridial toxin protein is administered in any of a 
number of physiologically acceptable carriers known to the art. When administered for the 
purpose of vaccination, the purified substantially endotoxin- free recombinant clostridial toxin 

10 protein may he used alone or in conjunction with known adjutants, including potassium alum, 
aluminum phosphate, aluminum hydroxide, Gerbu adjuvant (GmDP: C.C. Biotech Corp.). 
RIBI udju\ani (MPL; RIBI immunochemical Research. Inc.). QS21 (Cambridge Biotech). 
The alum and aluminum-based adjutants arc particularly preferred when vaccines arc to be 
administered to humans: however, any adjuvant approved for use in humans may be 

15 employed. The route of immunization may be nasal, oral, intramuscular, intraperitoneal or 
subcutaneous. 

The invention contemplates the use of soluble, substantially endoioxin-free 
preparations of fusion proteins comprising the C fragment of the ( *. hntulimtm type A, B, C. 
D. E. F. and Ci toxin as vaccines. In one embodiment, the vaccine compri.ses the C fragment 

20 of either the ( ' botulinum type A, B, C. D. E, F. or (i toxin and a poly-hislidine tract (also 
called a histidine tag). In a particularly preferred embodiment, a fusion protein comprising 
the histidine tagged C fragment is expressed using the pHT series of expression vectors 
(Novagen). The pET expression system utilizes a vector containing the T? promoter which 
encodes the fusion protein and a host cell which can be induced to express the T7 DNA 

25 polymerase (/.t/., a DE3 host .strain). The production of C fragment fusion proteins containing 
a histidine tract is not limited to the u.se of a particular expression vector and host strain. 
Several commercially available expression vectors and host strains can be used to express the 
C fragment protein sequences as a fusion protein containing a histidine tract (For example, the 
pOE series (pQE-S. 12. 16. 17. 18. 30, 31, 32. 40. 41, 42. 50. 51. 52. 60 and 70) of 

30 expression vectors (Qiagen) which are u.scd with the ho.si strains M15fpRKP4| (Qiagen) and 
S()l3009|pREP4| (Qiagen) can be u.sed to express I'usion proteins containing six histidine 
residues at the amino-lerminus of the fusion protein). Furthermore a number of commercially 
available expression vectors which provide a histidine tract also provide a protease cleavage 
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site between the histidine tract and the protein of interest icf:., botulinal toxin sequences). 
Cleavage of the resulting fusion protein with the appropriate protease will remove the 
histidine tag from the protein of interest (t'./?.. botulinal to,\in sequences) (see Example 28a. 
//?//•«). Removal of the histidine tag may be desirable prior to administration of the 
recombinant botulinal toxin protein to a subject {c.k- a human). 



in, 
I 



VI. Detection Of Toxin 

The invention contemplates detecting bacterial toxin in a sample. The term "sample" 
in the present specification and claims is used in its broadest .sense. On the one hand it i.s 
meant to include a specimen or culture. On the other hand, it \s meant to include both 
biological and environmental samples. 

Biological samples may be animal, including human. Iluid. solid (c jtr-. siool) or tissue: 
liquid and .solid food products and ingredients such as dairy iicni.s. vegetables, meat and meal 
by-products, and waste. Environmental samples include environmental material such as 
surface mailer, soil, water and indu.slrial samples, as well as samples obtained from lo(»d and 
dair\ processing instruments, apparatus, equipment, disposable and non-di.sposable items. 
I hcsc examples are not to be construed as limiting the .sample types applicable to the present 



invention. 



The invention contemplates delecting bacterial loxin by a competitive immunoassay 
method thai utilizes recombinant lo.xin A and loxin B proteins, antibodies raised against 
recombinant bacterial lo.xin proteins. A fixed amount of the recombinant toxin proteins are 
iml1lobili^ed to ;i solid support (c.f;.. a microliter plate) followed b\ the addition of a 
biological sample suspected of containing a bacterial loxin. The biological .sample is first 
mixed with afllnity-purificd or l»EC; Iraciionaied antibodies directed against the recoiTibinanl 
toxin protein. A reporter reagent is then added which is capable of detecting ihe presence of 
antibody bound to the immobilized loxin protein. The reporter substance may comprise an 
antibody with binding specificity for the antitoxin attached to a molecule which is used to 
idenlily ihe presence of the reporter .substance. If loxin is present in ihe sample, ihis toxin 
will compete with the immobilized recombinant toxin protein for binding to the anii- 
recombinani antibody thereby reducing the signal obtained following the addition of ihe 
reporter reagent. A control is employetl where the antibody is not mixed with the sample. 
This gives the highest (or reference) .signal. 
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The invention also coniemplaies dcicclinu bacterial toxin by a "sandwich" 
immunoassay method that utilizes antibodies directed against recombinant bacterial toxin 
proteins. Affinity-purified antibodies directed against recombinant bacterial toxin proteins arc 
immobilized to a sohd support (c'.;^'.. microliter plates). Biological samples suspected of 

5 containing bacterial toxins are then added followed by a washing step to remove substantially 
all unbound antitoxin. The biological sample is next exposed to the reporter substance, which 
hinds 10 antitoxin and is then washed free of substantially all unbound reporter substance. 
The reporter substance may comprise an antibody with binding specificity for the antitoxin 
lUtachcd lo a molecule which is u.sed lo identify the presence of the reporter substance. 

10 Identillcaiion of the reporter substance in the biological tissue indicates the presence of the 
bacterial toxin. 

It is also conicmplaled that bacterial toxin be detected by pouring liquids {c.^., soups 
and lUiicr lluid foods and feeds including nutritional .supplements for humans and other 
animals) over immobilized antibody which is directed against the bacterial toxin. It is 

15 contemplated thai the immobilized antibody will be present in or on such supports as 

cartridges, columns, beads, or any other solid support medium. In one embodiment, following 
the cxpiKsurc of the liquid lo the immobilized antibody, unbound toxin is sub.sianiially 
removed by washing, fhe exposure of the liquid is then exposed to a reporter substance 
which delects the presence of bound toxin. In a preferred embodiment the reporter substance 

20 is an enzyme. Iluorescent dye. or radioactive compound attached to an antil^ody which is 
directed again.si the toxin (/.e.. in a "sandwich" immunoassay). It is also contemplated that 
the detection .system will be developed as necessary (c.e., the addition of en/yme substrate in 
enzyme systems: observation using fluorescent light i'or Iluorescent dye systems: and 
quantitation of radioactivity for radioactive systems). 

25 

KXPERIMENTAL 

The following examples serve to illustrate certain preferred embodiments and aspects 
of the present invention and are not to be construed as limiting the scope thereof 

In the disclosure which follows, the following abbreviations apply: "C (degrees 
Centigrade): rpm (revolutions per minute): BBS-Tween (borate buffered saline containing 
fween): B.SA (bovine serum albumin): KLISA (enzyme-linked immunosorbent assay): Cl-A 
(complete I-reund's adiu\'ant): IFA (incomplete I'reund s adjuvant): IgCi (immunoglobulin (J): 
IgY (immunoglobulin Y): IM (intramuscular): IP (intraperitoneal): IV (intravenous or 
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intravasculai ): SC (subcutaneous); 11,0 (water): HCl (hydrochloric acid); LD,„„ (lethal dose 
for 100% of experimental animals): aa (amino acid): HPLC (hiyh performance liquid 
chromatography): kD (kilodallons): «jm (grams): (micrograms); mg (milligrams): ng 
(nanograms): mI (microliters): ml (milliliters): mm (millimeters): nm (nanometers): 
(micrometer): M (molar): mM (millimolar); MW (molecular weight): sec (seconds): min(s) 
(minuie/minutcs): hns) (hour/hours): MgCi, (magnesium chloride): NaCI (sodium chloride); 
Na.CO, (sodium carbonate): (J0,,„ (optical density at 280 nm): 0D„„ (optical density at 600 
nm): PAGE (polyacrylamide gel electrophoresis): PBS [phosphate bulTercd saline (150 mM 
NaCI. 10 InM sodium phosphate buffer. pH 7.2)]: PEG (polyethylene glycol): PMSF 
(phenylmethyKsulfonyl Hiioride): SDS (sodiunn dodecyl sulfate): Tris 
(tris(h\droxymethyl)aminomcihane): r:n.surc(liii) (Kn-sureB;. Ro.ss Laboratories. Columbus OH): 
I'.nfamilK (l-nfamilK. Mead Johnson): w/v (weight to volume): v/v (volume to volume): 
Amicon (An>ic«>n. Inc.. Beverly. .MA): Amre.sco (Amresco. Inc.. Solon. Oil): AlCC 
(American Ixpe Culture Collection. Rockville. MD): BBl. (Baltimore Biologies Laboratory, 
(ii divisit.n of Bect(»n Dickin.son). C-ockeysvillc. MD): Becton Dickinson (Recton Dickinson 
l.abware. Lincoln Park. NJ): BioRad (BioRad. Richmond. CA): Biotech (C-C Biotech Corp.. 
Poway. C.A): Charles River (Charles River Laboratories. Wilmington. MA): Cocalico 
(Cociilico Biologieals Inc.. Rcam.stowii. PA): CyiUx (CyiR.s Corp.. Norcros.s. (}A): falcon 
(('..tj. Baxter Healthcare Corp.. McCiaw Park. IL and Becton Dickinson): FDA (I cdcral Food 
and Drug Administration): I ishcr Biotech (Fisher Biotech. .Sprinulield. N.I): C,IBC() (Cirand 
island Biologic Company/BRL. (irand Island. NY): C3ibco-BRL (Life Icchnologics. Inc., 
(lailherslnirg. MD): Harlan Sprauuc Dawlcy (Harlan .Sprague Dauicy. Inc.. Madi.son. Wl): 
Mallinckrodt (a division of Baxter Healthcare Corp,. McCiaw Park. IL): Millipore (Miliipore 
Corp.. Marlborough. MA): New l-:ngiand Biolabs (New Lngland Biolah.s. hic. Beverly. MA): 
Nosagcn (Novauen. Inc.. Madi.son. Wl): Pharmacia (Pharmacia. Inc.. Pi.scataway. NJ); (^iagen 
((,)iugcn. Chatsworth. CA): Sasco (Sascn. Omaha. NL): Showdex (Showa Denko America. 
Inc.. New York. NY): Sigma (.Sigma Chemical Co.. St. Louis. MO): Sterogcne (Stcrogene. 
Inc.. Arcadia. CA): lech Lab (Tech Lab. Inc.. Blacksburg. VA): and Vaxcell (Vaxcell. Inc.. 
a .sub.sidiar> ofCyiRX Corp.. Norcross. (iA). 

When a recombinant protein is described in the specification it is referred to in a 
-shori-hand manner by the amino acids in the toxin sequence present in the recombinant 
protem rounded to the nearest 10. For example, the recombinant protein pMB 1850-2360 
contains amino acids 1852 through 2362 of the C. diffkUc toxin B protein. The specification 
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gives detailed construction details for all recombinant proteins such that one skilled in the art 
will know precisely which amino acids are present in a given recombinant protein. 

EXAMPLE I 

5 Produclion Of Iligh-Tilcr Antibodies To Clostrniium Jifftcilc Organisms in A Hen 

Antibodies to certain pathogenic organisms have been shown to be effective in treating 
diseases caused by those organisms. It has not been shown whether antibodies can be raised, 
against Closiridiwn diffk'ile, which would be effective in treating infection by this organism. 
10 Accordingly. ('. cliffkHu was tested as immunogen for production of hen antibodies. 

To determine ihe best course for raising high-tilcr egg antibodies against whole ( 
clifjkilc t)rganisms. different immunizing strains and differem immunizing concentrations were 
examined. The example involved (a) preparation of the bacterial immunogen. 
(h) imnuinizalion. (c) purillcation of anti-bacterial chicken antibodies, and (d) detection of 
15 anti-bacterial antibodies in the puritled IgY preparations, 

a) Preparation Of Bacterial Immunogen 

( \ Jijficilc strains 435^M (scrogroup A) and 435^^6 (serogroup C) were originally 
obtained from the A'rCC\ These two strains were selected because they represent two of the 

20 most commonly-occurring serogroups isolated from patients with aniibioiic-ussociaicd 
p.seudomembranous colitis. [Delmee a aL. J. Clin. Microbiol.. 28( 10):221() ( I W0).| 
Additionally, both of these strains have been previously characterized w ith respect to their 
virulence in the Syrian ham.sier model for ('. difficile infection, (nelmee al., J. Med 
Microbiol.. 33:85 (1W0).| 

25 The bacterial strains were separately cultured on brain heart infusion agar for 48 hours 

at 37°r in a Gas Pack ICO Jar (BB!.. Cockcysvillc. MD) equipped with a Gas Pack Plus 
anaerobic envelope (BBL). I'orty-eighl hour cultures were used because they produce better 
growth and the organisms have been found to be niore cross-reactive with respect to their 
surface antigen presentation. The greater the degree of cross-reactivity ol'our IgY 

30 preparations, the better the probability of a broad range of activity against different 
strains/serogroiips. |Toma cv <//.. J. Clin. Microbiol.. 26(3):426 ( l^)88).| 

i he resulting organisms were removed from the agar surface using a sterile daeron-tip 
swab, and were suspended in a solution containing 0.4% formaldehyde in PBS. pi I 7.2, This 
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concentration of formaldehyde has been reported as producing good results for the purpose of 
preparing whnle-organism immunogen suspensions for the generation of polyclonal anii-C*. 
infywilv antisera in rabbits. [Delmee et ol.. J. Clin. Microbiol.. 21:323 (1985): Davies at ai. 
Microbial Path.. 9:141 (1990).] In this manner, two separate bacterial suspensions were 
prepared, one for each strain. The two suspensions were then incubated at 4**C for I hour, 
lollovving this period of formalin-treatment, the suspensions were centrifuged at 4.200 x g for 
20 min.. and the resulting pellet.s were washed twice in normal saline. The washed pellets, 
which contained formalin-treated whole organisms, were resuspcnded in fresh normal saline 
such that ihc visual turbidity of each suspension corresponded to a #7 McFarland standard. 
(M.A.C. Ildelsicin. ''Pravassm^ Clinical Spcxinwns for Anacrohic Bacteria: Isolation and 
hlauificuiion Procedures^ in S.M. Finegold et al (eds.),. Bailey and Scott 's Dia\inostic 
Micrahinlo^y. pp. 477-507. (\V. Mosby Co.. (1990). I ho preparation of McFarland 
ncphclomeicr standards and the corresponding approximate number of oriianisms for each 
lube arc described in detail at pp. 1 72-1 73 of this volume. | l lach of the two Ul suspensions 
was then split into two separate volumes. One volume ofeacli suspension was voiumetrically 
adjusted. b\ the addition of saline, lo correspond to the visual turbidity (if a U\ Mcl-ariand 
standard. The //I suspensions contained approximately 3 x lO'* organisms/ml. and the 

^7 suspensions contained approximately 2 x 10^' organisms/ml. \hL\ The four resulting 
conccniraiion-adjusted suspensions of Ibrmalin-ireaied C. difficile organisms were considered 
10 he "haclerinl imnninouen suspensions." These suspensions were used immediately after 
preparation for the initial immunization. |.SVc' section (b).j 

I hc rormalin-ireauiieni procedure did not result in 100% non-viable bacteria in the 
immunogen su.spensions. In order lo increase the level of killing, the formalin concentraiioh 
and icngih of ircaimeni were boih increased for subsequent immunogen preparations, as 
described below in Table 3. (Although viability was decreased with the stronger formalin 
ireaimeni. 100% inviabiliiy of the bacterial immunogen suspensions was not reached.) Also, 
in subsequent immunogen preparations, the formalin solutions were prepared in normal saline 
in.stcad of PUS. At day 49, the day of the fifth immunization, the excess volumes of the four 
previous bacterial immunogen suspensions were stored frozen al -lO'^C for use during all 
subsequent immunizations. 
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b) Immunization 

For the initial immunization. 1.0 ml volumes of each of the four bacterial immunogcn 
suspensions described above were separately emulsilled in 1.2 ml volumes ofCFA (GIBCO). 
For each of the lour emulsified immunogen suspensions. tv\o tour-month old While Leghorn 
."^ hens (pre-laying) were immunized. (It is not necessary to use prc-laying hens: actively-laying 
hens can also be utilized.) Each hen received a total volume of approximately 1.0 ml of a 
single emulsified immunogen suspension via four injections (two subcutaneous and two 
intramuscular) of approximately 250 |.il per site. In this manner, a total ol" four different 
immunization combinations, using two hens per combination, were initiated for the purpose of 
10 evaluating both the effect of immunizing concentration on egg yolk antibody (IgY) 

production, and inlerstrain cross-reactivity of IgY raised against heterologous sirain.s. I he 
four immunization groups are summarized in Table 3. 

TABLE 3 



Innnuni/niion Ciroups 



C'iroup Dcsiunation 


Immunizinii Strain 


ApproNimnic Immiini/.ins Dose 


CI) 43594. «l 


strain 43.W 


1.5 ^ Ml' ori;aiiisms.'hcii 


CD :.-7 


>( (* 


1 .(> * M) opjan isms, hen 


CD 43.S%. «| 


C di ff h iic 
strain 435% 


1.5 y 10' organisms, hen 


CD 435%. '.'7 


It 


1 .0 « 10' organisms.' lien 



The lime point for ihc first scries of immunizations was designated as "day zero," All 
subsequent immunizations were performed as described above except that the bacterial 
immunogen suspensions were emulsified using II'A ((JIBCO) instead oTCI-'A. and for the 
later lime point immunization, the stored frozen suspensions were used instead of freshly- 
prepared suspensions. The immunization schedule used is listed in Table 4. 
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TABLE 4 

Immunization Schedule 



Day or Immunizaiion 


Formalin-Treatment 


hnmuntigen Preparation Used 


0 


1%. 1 hr. 


freshly-prepared 


M 


1%. overnight 




2 1 


l"u. overnight 




35 


I"'... 48 hrs. 






1%. 72 hrs. 




70 




stored frozen 


«5 






105 







c) Purification Of Anti-Bacterial Chicken Antibodies 

(iroups dl four cges were collcclcd per immunizaiion yroup between days 80 and X4 
posl-iniiial immunizaiion. and chicken immunoglobulin (IgY) was cMracied aeeordinii lo a 
modiileaiioii ot* the procedure of A. Poison et al.. Immunol. Comm.. ^>:495 (1980). A gentle 
stream of distilled water from a squirt bottle was used to .separate the yolks IVom the whites, 
and the yolks were broken by dropping them through a funnel into a graduated cylinder. The 
lour indi\ idual yolks were pooled lor each group. The pooled, broken yolks were blended 
with 4 volumes of egg e.xtraciion bulTer to improve antibody yield (egg extraction buffer is 
0.01 M sodium phosphate. 0.1 M NaCl. pH 7.5. containing 0.005% thimerosal). and PiXi 
8000 ( Amre.sco) was added to a concentration of .1.5%. When all the Pl-G dissolved, the 
protein precipitates that formed were pelleted by centrilugation at l.'>.00() x g for 10 minutes. 
The supernatanis were decanted and filtered through chee.sccloth lo remove the lipid layer, 
and the 1^1 Xi was added to the supcrnalanis lo a final concentration of 12% (the supernatanis 
were a.ssumed to contain 3.5% PEG). After a second cenirifugaiion. the supernatanis were 
discarded and the pcllels were centrifuged a final lime to extrude the remaining Pl:(j. fhese 
crude IgY pellets were then dissolved in the original yolk volume of egg extraciion buffer and 
stored at 4°C, As an additional control, a preimmune IgV solution was prepared as described 
above, using eggs collected from unimmunized hens. 
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d) Detection Of Anti-Bacterial Antibodies In The Purified I^Y 
Preparations 

In order to evaluate the relative levels of specific ami-r. UifficHc activity in the lyY 
preparations described above, a modified version of the vvholc-orjzanisni ELISA procedure of 
5 N.V. Padhyc ei ai. 3. Clin. Microbiol. 29:99-103 (1990) was used. Frozen organisms of both 
( \ (lijjfk'ilc strains described above were thawed and diluted to a concentration of 
approximately I x ]()' organisms/ml using PBS. pfl 7.2, In this way. two separate coaling 
suspensions were prepared, one for each immunizing strain. Into the wells of 96-\vell 
microliter plates (l-alcon, Pro-Bind Assay Plates) were placed 100 volumes of the coaling 

10 suspensions. In this manner, each plaie well received a total of approximately I x 10" 

organisms of one strain or the other. The plates were then incubated at 4°C overnight. The 
next morning, the coaling suspensions were decanted, and all wells were washed three limes 
using PBS. In order to block non-specific binding sites. 100 iil ol'().5% BSA (Sigma) in IMiS 
was then added lo each well, and the plates were incubaled for 2 hours al room icmperauire. 

15 The blocking solution was decanted, and 100 |.il volumes ol" the lu^' preparations described 
above were iniiiaiiy diluted 1:500 with a .solution of 0.1% MSA in PBS. and then serially 
diluted in 1:5 steps. The following dilutions were placed in the wells: 1:500. 1:2.500. 
1:62.5000. 1 :." 12.500. and 1:1.562.500. The plates were again incubaled for 2 hours al room 
icmperauire. I'ollowing this incubaiion. the IgY-conlaining solutions were decanied. and the 

20 wells were washed three limes using BBS- Twecn (0.1 M boric acid. 0.025 M .sodium borate. 
1.0 M NaCI. 0.1% Tween.20). followed by two washes using PBS-Tween (I). 11.'. Twecn-20). 
and tlnally. two washes using PBS only, l o each well, 100 ^1 tif a 1:750 dilution of rabbit 
anti-chicken Ig(i ( whole-molecule)-alkaline pho.sphaia.se conjugate (Sigma) (diluted in 0.1% 
BSA in PBS) was added, fhe plaies were again incubated for 2 hours at room temperature. 

25 The conjugate .solutions were decanted and the plates were washed as described above, 
substituting 50 mM Na.CO,, pU 9.5 for the PBS in the final wa.sh. The plates were 
developed by ihe addition of 100 pl of a solution containing I mg/nil para-nitrophenyl 
phosphate (Sigma) dissolved in 50 mM NaXO;. 10 mM Mg(."l.. pi I 9.5 lo each well, and 
incubating the plates at room temperature in the dark for 45 minutes, fhe absorbance of each 

.lO well was measured at 410 nm using a Dynatcch MR 700 plale reader. In this manner, each of 
the four IgY preparations described above was tested for reacliviiy again.st both of the 
immunizing (' c/ifficilc strains: .strain-specific, as well as cros.s-reactive activity was 
determined. 



- 46 • 



wo 98/08540 PCT/US97/1S394 

Tabic 5 shows the results of the whole-organism ELISA. All four IgY preparations 
demonstrated significant levels of activity, to a dilution of 1:62.500 or greater against both of 
the immunizing organism strains. Therefore, antibodies raised against one strain were highly 
cross-reactive with the other strain, and vice versa. The immunizing concentration of 
organisms did not have a signilicant effect on organism-.specific IgY production, as both 
concentrations produced approximately equivalent responses. Therefore, the lower 
immunizing concentration of approximately 1.5 x 10'' organisms/hen is the preferred 
immuni/inu concentration of the t^^o tested. The prcimmune IgY preparation appeared to 
possess relaiivdy low levels of C. t/////V/7t-reactive activity to a dilution ol" 1:500. probably 
due to prior exposure of the animals to environmental Clostridia. 

•An initial whole-organism FLISA was performed using IgY preparations made from 
single C I) 4?5^)4. n\ and CD 4.-J596. #1 eggs collected around day 50 (data not shown). 
Specific liters were found to be 5 to 10-fold lower than those reported in Tabic 5. These 
results demonstrate that it is possible to begin immunizing hens prior to (he time that they 
begin to lay eggs, and to obtain high titer .specific IgY from the first eggs that are laid. In 
other words, it is mn nccc.s.sary to uait for the hens to begin laying before the immuniz;ition 
schedule is started. 
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TABLE 5 

ResuUs or The Anii-C diflkih Who le- Organ Lsm ELISA 



leY Pi-eparaiion 


Dilution Of IgY Prep 


43 594 -Coated Wells 


43596-Coaied Wells 






1 .7-16 


1.801 






1 Arfl 


1 .670 


C D 43594, v,\ 


I:I2..500 
1:62,500 


0.202 
0.136 


0.8 1 2 
0. 1 79 




l:3!2..S0O 


0.012 


0.080 




1:1.562.500 


0.002 


0.020 




1 ■ ^nn 


1 . /oil 


1 *n t 

1 .771 




1 . j..nuiJ 


1 /I*) C 

1 .UJ> 


1 .078 


CT) 435<)4. «7 


1:12.500 
);62.500 


0.1 88 

0.052 


0.382 
0.132 




1:312.500 


0.022 


0.043 




1:1,562.500 


o.oos 


0.024 




1 : ^00 


1 ..^2« 


1 .790 




1 i.t\t\ 
\ ..;.,^00 


0.}i.>_ 


1 .177 


(1) 435%. 'i\ 


1; I2.5(K) 

1:62.500 


0.247 

0.050 


0.452 
0.242 




1:312.500 


0.010 


0.067 




1:1.562.500 


0.000 


{)X)^(^ 




1 :50() 


1 .702 


1 .505 




1 :2.50() 


0.706 


O.K()6 


C*n 43506. ^.'7 


I:I2..500 

1:62.500 


0.250 

0.030 


0.282 

0.078 




1:312.500 


0,002 


0.017 




l:l..562..S00 


0.000 


0.0 to 




1:500 


0.142 


0.30^) 




1:2.500 


0.032 


0.077 


Prcitiinnint' lt:Y 


1:12..S00 


0.006 


0,024 


l:62..500 


0,002 


0.012 




I:3I2..>00 


0.004 


0.010 




l:l..VO..sn(^ 


0.002 


0.014 



10 EXAMPLE 2 

Trcatmenl 0\ i \ ilifJicHc Infcciion With Anii-C Jifj/kilc Antibody 

in order to dctcmiinc whctlicr the immune IgV antibodies rai.sed aLiain.si whole ('. 
iliffk'ilc organisms were capable of inhibiting the inleclion of hamsters by iliJficHc, 
\5 hanvsiers infected by ihe.sc bacteria were uiili/ed. |I.yerly ct ul.. Infect. Immun.. .^'^:2215- 
2218 ( I ).) This example involved: (a) determinaiion of the lethal dose of (.'. ilifjicilc 
organisms: and (b) treatment of infected animals with immune antibody or control antibody in 
nutritional solution. 
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SI) Determination Of The Lethal Dose Of C difficile Organisms 
Determination of tlie lethal dose of C. i/i(ficilc organisms was carried out according to 
the model described by D.M. Lyerly ei al.. Infect. Immun.. 59:2215-2218 (1991). c'. i/ifficile 
strain A TCC 4.1596 (scrogroup C. ATCC) was plated on BHI agar and grown anacrobically 
(BBI. Cias Pak 100 system) at 37''C lor 42 hour.s. Organisms were removed from the agar 
surface using a sterile dacron-lip swab and suspended in .sterile 0.9% NaCI solution to a 
density of 10" organisms/ml. 

In order to determine the lethal do.se of (.'. i/iffkUc in the presence of control amibody 
and nutritional formula, non-immune eggs were obtained from unimmunized hens and a 12% 
PEG preparation made as described in Example 1(c). This preparation was redissolvcd in one 
fourth the original yolk volume of vanilla llavor Ensure*. 

Starting on da\ one. groups of female Golden Syrian hamsters (Marian Sprague 
Dawlcy). 8-9 weeks old and weighing appro.\imaiely 100 gm. were orail> administered 1 ml 
of the preimmunc.'l-nsureK formula at time zero. 2 hour.s. 6 hours. ;md 10 hours, .At I liour. 
animals were orally administered 3.0 mg clindamycin HCI (Sigma) in I nil of water. This 
drug predispo.ses hamsters to ( •. (lifficih' infection by altering the normal intestinal tlora. On 
day two. the animals were given 1 ml of the preimmune IgY/Ensure*: formula at time zero. 2 
hours. 6 imurs. and 10 hours. Al 1 hour on day two. dilfcrcnt groups of animals were 
inoculated orally with saline (control), or lO'. W\ 10^ or I0« C </////c/7c' (.rganisms in I ml 
ol saline, i rom da\ .s .1-12. animals were given I ml of the preimmune IgY/linsure n formula 
three times duil\' and observed for the onset ol' diarrhea and death. Laeh animal was housed 
HI an uuliv idual cage and was offered food and water cul liMmni. 

Administration of 10" - 10" organisms resulted in death in 1-4 days while the lower 
doses o|- !()■ - 10^ organisms caused death in 5 days. Cecal swabs taken from dead animals 
indicated tiie presence olT. diflk ih. (iiven the effectiveness of the 10- do.se. this number of 
organisms was chosen for the following experiment to see if hyperimmune anli-C. difficUc 
antibody could block infection. 

b) Treatment Of Infected Animals With Immune Antibody Or 
Control Antibody In Nutritional Formula 

The experiment in (a) was repeated using three groups of .seven hamsters each. (Jroup 
A receised no. clindamycin or C. ilitfidle and was the .sur\ ival control. Group B received 
clindamycin. 10- C. ilifjfkile organisms and preimmune IgY on the .same .schedule as the 
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aiiiniais in (a) above. Group C received clindamycin. !()■ ('. c/iffk'ih organisms, and 
hyperimmune anti-C difficile IgY on the .same schedule as (iroup B. The anti-C difficile IgY 
was prepared as described in Example i except thai the 12% FM-G preparation was dis.S()lved 
in one Tourlh the original yolk volume of tnsure^K;, 

All animals were observed for the onset of diarrhea or oiher disease symptoms and 
death. Each animal was housed in an individual cage and was olTcred food and water ad 
iihiitmh The results are shown in Table 6. 



TABLE 6 

ri)c E-lTect OfOrnl Fecdinu Of Hyperimmune luY Antibody on (* JUficiU' Infcciion 



Animal Group 


Time To Diarrhea*' 


Time To Deaih"* 


A 


pre- immune luY only 


no diarrhea 


no deaths 




C'lindnmyciii. liiffk-Uc. preimmune IjJiY 


M) hrs. 


4*) lirs. 


C 


C limianiycin. C liifjfkik*. immune Ii»Y 


.13 hrs. 


56 hrs. 



15 Mean of seven animals. 

I Uimstcrs in the control group A did not develop diarrhea and remained healthy during 

the cxpcrimeniai period, llamsiers in groups 13 and C developed diarrheal disease. Anii-(.'. 

difficile IgY did not protect the animals from diarrhea or death, all animals succumbed in the 

same time interval as the animals treated with preimmune IgY. Thus, while immuni/ation 
20 with whole organisms apparently can improve sub-lethal symptoms with particular bacteria 

(see U.S. Patent No. 5.080.895 to 11. Tokoro). such an approach does not prove lo be 

productive lo protect against the lethal effects of ( ' difficile. 

EXAMPLE 3 

25 Production of hoiulinum Type A Antitoxin in Mens 

In order to determine whether antibodies could be raised against the toxin produced by 
clostridial pathogens, which would lie effective in treating clostridial diseases, antitoxin to (' 
/wttilimim type A toxin was produced. This example involves: la) toxin modilication: (b) 
30 immunization: (c) antitoxin collection: (d) antigenicity as.sessmeni: and (e) ass;iy of antitoxin 
tiler. 



- 50 - 



WO98AttS40 



PCT/US97/I5394 



10 



15 



») Toxin Modification 

C. hoiulinum type A toxoid was obtained from B. R. DasGupta. From this, the active 
type A neurotoxin (M.W. approximately 150 kD) was purified to greater than 99% purity. 
iim)rdiny to published methods. f B.R. DasGupta & V. Sathyamoorihy. Toxicoii. 22:415 
(1984).] The neurotoxin was detoxified with formaldehyde according to published methods. 
|U.R. .Singii & B.R. DasiCiupta. Toxicon. 27:40.'? (1989).) 

i>) Immunization 

C. hoiiilimim toxoid lor immunization was dissolved in PBS (I nig/ml) and was 
cmulsilieJ with an approximatciv equal volume ofCFA (GIBCO) for initial immunization or 
li-A lor boo.sier immuniz:iti«n. On day zero, two white leyhorn hen.s. obtained iVom local 
breeders, were each injected at multiple sites (intramuscular and subcutaneous) uith I nil 
inactivated toxoid emuLsilled in I ml CFA. Subsequent booster immunizations ucre made 
according to the lollowinii .schedule Ibr day of injection and loxoid amount: clays 14 and 21 - 
0.5 my: day 171 - 0.75 mg: days 394. 401. 409 - 0.25 mu. One hen received an additional 
bt)().sier of 0.1 50 mu on dav 544. 



c) Antitoxin Collection 

Total yolk immunoglobulin (IgY) was extracted as described in r.\ample 1(c) and the 
20 IjtY pellet was di.ssolved in the original yolk volume of PBS wiih lliimerosal. 

d) .Vntiecnicity Asse.s.smcnt 

i:ggs were collected from day 409 through day 423 to assess whether the toxoid was 
siilllciently immunogenic to rai.se antibody, l^ggs from the two hens were pooled and 

25 antibod\ was collected as described in the standard PHG protocol. |TIxample Kc).) 

Antigenicity of the botulinal toxin was a.s.sesscd on Western blots, [he 150 kD detoxified 
type A neurotoxin and unmodified, toxic. 300 kD botulinal type A complex (toxin u.sed for 
intragastric route administration Ibr animal gut neutralization experiments: see I-xample 6) 
were separated on a SDS-polyacrylamide reducing gel. The Western blot technique was 

30 performed according to the method of Towbin. fll. Towbin et ciL Proc. Natl, Acad. Sci. 

I'SA. 76:4350 (1979). I Ten ^g samples of ('. hoiulinum complex and toxoid were dis.solvcd 
in SDS reducing sample buffer (1% SDS. 0.5% 2-mcrcaptoethanol. 50 mM Tris. pi I 6.8. 10% 
glycerol. 0.025% wA- bromphenol blue. 10% P-mereaptoethanol). heated at 95''C lor 10 min 
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and separated on a I mm ihick 5% SDS-polyacr\'lamide gel. |K. Weber and M. 
Oshorn/'Proicins and Sodium Dodecyl Sulfate: Molecular Wei^hi Deiermination on 
Polyacrylamule dels and Related Pmcvdures:' in The Proteins. 3d Edition (IK Neurath & 
R.L. Hill. cds). pp. 179-223, (Academic Press. NY. 1975).| Pari of ihc uel was cut olT and 
> ihe proteins were stained with Coomassic Blue. The proteins in the remainder of the gel were 
transferred to nitrocellulose using the Milliblot-SDI: electro-blotiing system (Miliipore) 
according to manufacturer's directions. The nitrocellulose was temporarily stained with 10% 
Ponceau S fS.B. Carroll and A, i-aughon. "Produciion and Purification of Polyclonal 
Antibodies to the hoiei^^n Se^ttient of 0-^alactosidase Fusion Proteins" in DNA C'toniny: .1 

1(3 Practical Approach. Vol.111. (D. Glover, ed.), pp. 89-1 1 1. IRL Press. O.Ntbrd. ( I987)J to 

visualize the lanes, then destaincd by running a gentle stream of distilled water over the bloi 
lor .several minutes. I'lic nitrocellulose was immersed in PBS containing 3% BSA overnight 
at 4°C lo block any remaining protein binding sites. 

The blot was cut into strips and each strip was incubated with the appropriate primary 

15 antibod). i he avian anli-(* hotulinum antibodies (described in (c)] and prc-immune chicken 
antibody (as control) were diluted 1:125 in PBS containing I mg/ml BSA lor 2 hours at room 
temperature. I he blots were washed with two changes each of large \ohtmes of PBS. BBS- 
Tween and PBS. successively (10 min/wash). Goat anti-chicken IgCi alkaline phosphatase 
conjugated .secondary antibody (l-isher Biotech) was diluted ];5()() in PBS containing 1 mg/ml 

20 BSA and incubated with the blot for 2 hours at room icmperalurc. The blots were washed 

w ith two changes each of large volumes of PBS and BBS- I'ween. I'ollowed by one change of 
PBS and 0.1 M Tris-MC.T pM 9.5, Blots were developed in freshly prepared alkaline 
phosphatase substrate buffer (100 ^ig/ml nilroblue letrazolium (Sigma). 50 |.ig/ml 5-bromo-4- 
chloro-3-indoIyl phosphate (Sigma). 5 mM MgCI> in 50 mM Na^C'O.. pH 9.5). 

-5 The Western blots are shown in Tigurc I. The anti-T fiofulinum IgY reacted to the 

toxoid to give a broad imnninoreaclive band at about 145-150 kD on the reducing gel. This 
toxoid is refractive to disulfide cleavage by reducing agents due to formalin crosslinking. The 
immune IgY reacted with the active toxin complex, a 97 kD ( hotulimun type A heavy chain 
and a 53 kD light chain. The preimmune IgY \Nas unreactive lo the ( hotulinum complex or 

30 toxoid in the Western blot. 
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c) Antitoxin Antibody Titer 

The ly Y antibody titer to C. hoiulinum type A toxoid of ejjgs harvested between day 
4(W and 423 was t-valuateti by IXISA. prepared as follows. Nineiy-six-well Falcon Pro-bind 
plates were coaled overnight at 4''C with 100 mI/wcII toxoid fB.R. Singh & B.R. Das Gupta. 
Toxicon 27:40.1 ( 1989) | at 2.5 Mg/ml in PBS. pH 7.5 containing 0.00.5% thimerosal. The 
lolltiwing day the wells were blocked with PBS containing 1% BSA lor 1 hour at .■57°C. The 
IgY from immune or preimmunc eggs was diluted in PBS containing 1% BSA and 0,05% 
Twcen 20 and the plates were incubated for 1 hour at 37»C. 1 ho plates were washed three 
times u iih PBS containing 0.0.5% Tween 20 and three times with PBS alone. Alkaline 
phosphalasc-conjugated goat-anti-chickcn IgO (Fisher Biotech) was diluted 1:750 in PBS 
containing 1% BSA and 0.05% Tween 20. added to the plates, and incubated 1 hour at j7°C. 
The plates were washed as lierorc. and p-nitrophenyl phosphate (Sigma) at 1 mg/ml in 0.05 M 
Na.CO,. pl l 9.5. 10 ni.M MgCU was added. 

The results are .shown in I'igurc 2. C hickens immunized with the toxoid generated 
high titers of antibody to the imnuinogen. Importantly, eggs from both immunized hens had 
signilicani anii-immunogen antibody tilers as compared to preimmune control eggs. The anti- 
(■ hniiilimim IgV pt)sse.s.sed significant activity, to a dilution of 1:91.750 or greater. 

EXAMPLE 4 

Preparation Of Avian ligg Yolk Immunoglobulin In An Orally Administrable l orm 

In t>rdcr to admini.ster avian IgY antibodies orally to e.xpcrimenial mice, an effective 
dcli\cr\ formula f^r the IgY had to be determined. The concern was that if the crude IgY 
was dissoKcd in PBS. the saline in PBS would dehydrate the mice, which might prove 
luirmfiil o\cr the duration o[' the study. Therefore, alternative methods of oral administration 
of IgY were tested. The example involved: (a) isola-lion of immune IgY: (b) solubilization 
of IgY in water or PBS. including subsequent dialysis of the IgY-PBS solution with water to 
eliminate or reduce the .salts (salt and phosphate) in the buffer: and (c) eompari.son of the 
quantity and activity of recovered IgY by absorbance at 280 nm and PA(il:. and enzyme- 
linked ininuinoassay (ELISA). 
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a) Isolation Of Immune IgY 

In order to investigate the most effective delivery formula for IgY. wc used IgY which 
was raised against Ciotalus clunssiis fem/icus venom. I hrce eggs were collected from hens 
mimunized with the C\ Jurissus tcmfkus venom and IgY was extracted from the yolks using 
the modi lied Poison procedure described by Thalley and Carroll IBio/ I cchnology, «:9.14.938 
(1 990) I as described in Example 1(c). 

The egg yolks were separated from the whites, pooled, and blended uiih four volumes 
of PBS, I»owdered PEG 8000 was added to a concentration ol' 3.5%. The mixture was 
centrifuged at 10.000 rpm for 10 minutes to pellet the precipitated protein, and the 
supernatant was liliered through cheesecloth to remove the lipid la\er. Powdered PIXJ 8000 
was added to the supernatam to bring the fmal PEG concentration to 12% (assuming a PEG 
concentration of 3.5% in the supernaiani). The 12% PEG/lyY mixture was divided into two 
equal volumes and centrifuged to pellet the IgV, 

h) Solubilization Of The I^Y f n Water Or PBS 

One pcllel was rcsuspended in 1/2 the original yolk Nolumc of PBS. and the other 
pellet was rcsuspended in 1/2 the original yolk volume of water. The pellets were then 
cenirifuLjed to remoxe any panicles or insoluble material. The IgV in PBS solution di.ssolved 
readily but ihc fraction rcsuspended in water remained cloudy. 

In order to satisfy anticipated sterility requiremenis for orally administered aniihodies. 
the aniibod) solution needs to he filler-sterilized (as an aliernalive to heat .sterilization which 
would desiro> the antibodies). I he preparation of kY rcsuspended in water was too cloudy 
to pass through either a 0.2 or 0.45 Mm membrane filter, .so 10 ml of the PBS rcsuspended 
Iraciion was dialyzed overnight at ro(mi temperature against 250 nil t)f water, fhe lollowinu 
morning the dialysis chamber was emptied and relllled with 250 ml of fresh II.O for a second 
dialysis. Thereafter, the yields ol soluble antibody were determined at OD,,,, and are 
compared in fable 7, 
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TABLE 7 



Fraction 


Absorbance Of 1:10 Dilution At 280 nm 


Percent Recover)' 


l»BS dissolved 


M49 


lOOVo 


II.O dissolved 


n.706 


61% 


PBS dissolvcd;'H:0 dialyzed 


0.88.S 


77% 



Kesuspenclinu the pellets in PBS followed by dialysis against water rceovercd more 
Miitiboiiy than direetly resuspending the pellets in water (77% versus 61%). liquivalcnl 
v<>lunie.s o( ilic ly Y preparation in PBS or water were compared by PAGK. and these results 
were in accordance with the absorbance values (data not shown). 

c) .Activity Of IgY Prepared With Different Solvents 

An I^I.ISA was performed to compare the binding acti\ ity of the IgY extracted by 
each procedure described above. C. i/uris.sm- larifkus (CU/./. ) venom at 2.5 ^ypm\ in PBS 
was u.scd to coat each well ol a %-well microtiter plate. The remaining protein binding sites 
were bl.Kked uith PBS containing 5 mg/ml BSA. Primary antibody dilutions (in PBS 
containing I mg/ml BSA) were added in duplicate. After 2 hours of incubation at room 
temperature, the unbound primar> antibodies were removed by washing the wells with I'BS. 
B^^S-T\vecn. and PBS. The species s|X'cillc secondary antibody (goat anti-chicken 
immun.>glohulin alkaline-phosphatase conjugate (Sigma) was diluted 1:750 in PBS containing 
I mg/ml BSA and added to each well of the microtiter plate. After 2 hours of incubation at^ 
room temperature, the unbound secondary antibody was removed by u jshing ihc plate as 
before, and freshly prepared alkaline phosphatase substrate (Sigma) at 1 mg/nil in 50 miM 
Na,CO,. I() mM MgCI,. pll 9.5 was added to each well. The color development was 
measured on a Dynateeh MR 700 microplate reader using a 412 nm filter. The results are 
shown in Table 8. 

The binding assay results parallel the recovery values in Table 7. with PBS-dis.sc)lved 
IgY .showing slightly more activity than the PBS-di.ssolved/l l,0 dialy/ed antibod> . The 
\w.ter-dissohed-antibi>dy had considerably le.ss binding activity than the other preparations. 
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EXAMPLE 5 

Survival Of Antibody Acliviiy After Passage Through The (iastroinicstinal Tract 

hi order to determine the feasibility of oral administration of antibody, it was of 
5 interest lo determine whetiier orally administered IgY survived passage through the 

gastrointestinal tract. The example involved: (a) oral administration of specific immune 
antibody mixed with a nutritional formula: and (b) assay of antibody acliviiy cxiracicd from 
feces. 

TABLE 8 

H) Antiiicn-Htndine Activity Of IgY Prepared With DilTcrcnt Solvents 



I')iluti()n 


Preimmunc 


PBS Dissolved 


II.O Dissolved 


PBS/M.C) 


1:500 


0.005 


1.748 


1.577 


1.742 


l:2.5()() 


0.004 


0.(>44 


0..)4<) 


0.606 


1 .12. 500 


0.001 


0.144 


0.054 


(1.000 


1:62. .*?00 


0.001 


0.025 


0.007 


0.0 10 


1:312.500 


0.010 


0.000 


0.000 


0.002 



a) Oral Administration Of Antibody 

fhe IgY preparations used in this example are the same PUS-dissolved/l-KO dialyzcd 
20 anlivenom materials obtained in Kxample 4 above, mixed with an equal volume of F.nfamil*-. 
Two mice were used in this experiment, each receiving a different diet as follows: 

1 ) water and food as usual: 

2) immune IgY preparation dialyzcd against water and mixed 1:1 with l:nfamil*4. 
( fhe mice were given the corresponding mixture as their only source of food and water). 

25 

h) Antibody Activity After Ingestion 

Aikr both mice had ingested their rcspcclive lluids. each lube was rcllHed with 
approximalcly 10 ml of ihc appropriate lluid llrsl thing in ihc morning. l.^y mid-morning 
there was aboul 4 to 5 ml of liquid left in each tube. At this point slool .satnplcs were 
.'^O collected from each mouse, weighed, and dissolved in approximately 500 PBS per 100 mg 
slool sample. One hundred and sixty mg of control stools (no antibody) and W mg of 
experimental stools (specific antibody) in 1.3 ml microfuge tubes were dissolved in 800 and 
500 pi PBS. respectively. The samples were heated at 37%' ibr 10 minutes and vortexed 
vigorously. The experimenlal stools were aLso broken up with a narrow spaiula. Each .sample 
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was centrifuged for 5 minutes in a microfuge and the supernatants, presumably containing the 
antibody extracts, were collected. The pellets were saved at 2-8^C in case future extracts 
were needed. Because the supernatants were tinted, they were diluted five-fold in PBS 
containing 1 mg/ml BSA for the initial dilution in the enzyme immunoassay (RLISA). The 
primary extracts were then diluted five-fold serially from this initial dilution, fhe volume of 
primnrx- extract added to each well was 190 ^1. The ELISA was performed exactly as 
described in Example 4. 

TABLE 9 



10 



Dilution 


Preimmune IgV 


Control Fecal Exiracc 


tXP. Fecnl tx tract 


1:5 


0 


(um 


0.032 


1:25 


0.016 


• 0 


0.016 


1:125 


- 0 


' 0 


0.009 


1:625 


n 


0.003 


0.001 


i:.ll25 1 -0 


0 


0.000 



15 



20 



There vNiis some active ainibod\ in the fecal extract iVoni the mouse given the specific 
untibiHiy in finCamilJl lormula. but it was present at a very low level. Since the samples were 
assayed at an initial 1:5 dilution, the binding observed could have been hiyher with less dilute 
samples. ( ■«>nsequently. the mice were allowed to continue ingesting cither regular food and 
water or the specific IgY in Enfamil*. Jbrmula. as appropriate, so the as.say could be repeated. 
Another i:i.lSA plate was coated overnight with 5 pg/ml ol' C.J.i. venom in PB.S. 

I he lol lowing morning the liLISA plate was blocked with 5 mg/ml BSA. and the local 
.samples were extracted as before, except that in.stead of heating the extracts at .■;7''C. the 
.samples were kept on ice to limit proteolysis. The samples were a.ssaycd undiluted initialK . 
and in 5X serial dilutions thereafter. Otherwise the assay v\as carried out as before. 
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TABLE 10 

Specific Antibody Survives Passage Through The Gasiroiniesiinal Traci 



Dilution 


Preimmune luY 


Conirol Rxiraci 


txp. Extract 


tindiluied 


0.003 


0 


0.379 


1:5 


• 0 


• 0 


0.071 


1 25 


0.000 


0 


0.027 


1:125 


0.00.1 


■ 0 


0.017 


1:625 


o.ooo 


0 


0.008 


1:3125 


0.002 


0 


0.002 



The experiment confirmed the previous results, with the antibody aeiiviiN markedly 
higher. The control tccal extract showed no nnU-Cdf. activity, even undiluted, while the 
lecal extract from the am\-C\t/j. lyY/IZnramiitfe-l'ed mouse showed considerable mu-Cdr 
activity. Fhis experiment (and the previous expcrimenl) clearly demonstrate that active IgY 
antibody survives passage through the mouse digestive tract, a llnding with la\orable 
implications for the success of IgY antibodies administered orally as a therapeutic or 
prophylactic. 

EXAMPLE 6 

In F/iYi Neutralization Of Type hnfnlinum 
l ypc A Neurotoxin By Avian Antitoxin Antibody 

I'his example demonstrated the ability of PHG-purificd antitoxin, collected as 
described in Hxampie 3. to neutralize the lethal elTcct olT. hoiulinuni neurotoxin type A in 
mice. To determine the oral lethal dose (LDjoo) of toxin A. gioups of BALD/c mice were 
given dilTerenl doses of toxin per unit body weight (average body weight of 24 grams). I or 
oral administration, toxin A complex, which contains the neurotoxin associated w iih other 
non-toxin proteins was used. This complex is markedly more toxic than purified neurotoxin 
when given by the oral route, |l. Ohishi ct ai., hifcci. Immun.. I6:I()() (P)??).) ( \ hotitlimmf 
toxin type A cimiplex, obtained from l-ric Johnson (Universiiy Of Wisconsin. Madison) was 
250 ^ig/ml in 50 niM sodium ciiraic. pi I 5.5. specific loNicity 3 x 10^ mouse LD5(,/mg with 
parenteral administration, Approximaicly 40-50 ng/gm body weight was usually fatal within 
48 hours in mice maintained on conventional food and water. When mice were given a diet 
ad libitum of only Enfamil(R) the concentration needed to produce lethality was approximately 
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2.5 limes higher (125 ng/gm body weight). Botulinal toxin concentrations of approximately 
200 ng/gm body weight were fatal in mice fed Enfamil® containing preimmune IgY 
(resuspcndeU in Enfamilig) at the original yolk volume). 

The oral LD,,^, off. ImniUnum loxin wa.s also determined in mice that received 
known amount.s of a mixture of preimmune IgY-Ensure<B) delivered orally through feeding 
needles. Using a 22 gauge feeding needle, mice were given 250 pi each of a preimmune 
IgY-CnsurcH mixture (preimmune IgY dissolved in 1/4 original yolk volume) 1 hour before 
and I, '2 hour and 5 hours after administering botulinal toxin. Toxin conceniration.s given 
orally ranged from approximately 12 to .112 ng/gm body weight (()..! to 7.5 Mg per mouse). 
Hotulinal toxin complex concentration of approximately 40 ng/gm body weight ( I ^.g per 
niou.sc) was lethal in all mice in less than 36 hours. 

I wo groups of BALB/c mice. 10 per group, were each given orall\ a single dose of I 
ug oiich of botulinal loxin complex in 100 ^1 of 50 mM sodium ciirate pH 5.5. The mice 
received 250 )\\ treatments of a mixture of either preimmune or immune IgY in F.nsurcK^ (1/4 
original \olk \-olume) I hour before and 1/2 hour. 4 hours, and 8 hours after botulinal toxin 
admini.stratii>n. The mice received three ircamicnts per day for two more da\s. The mice 
were obser\ed for % hours. The survival and mortality are shown in fable 1 1 . 

TABLE II 



To.sill l^o.sc Hi! gni 


Antibody Type 


Number Of Mice Alive 


Number Of Mice Dead 


■4 1 .6 


iH)n-iniiTU!ne 


0 


10 


-ll.(> 


iinii-botultnnl toxin 


10 


0 



All mice ircaled with the preimmune IgY-Fnsurev.K mixture died uiiUin 46 liours post- 
toxin adminisiralion. The average lime of death in the mice was 32 hours post loxin 
adminisiraiion. Treaimcnis o\' preimmune luY-EnsurcCH; mixture did not continue beyond 24 
hours due lo extensive paralysis of ihe mouth in mice of this group. In conirasi. all ten mice 
ircaied with the immune anti-boiulinal toxin lgY-Ensure(!l^ mixture survived past 96 hours. 
Only 4 mice in this group exhibited symptoms of botulism toxicity (two mice about 2 days 
after and two niice 4 days after toxin administration). These mice eventually died 5 and 6 
days later. Six of the mice in this immune group displayed no adverse effects lo the loxin 
and remained alive and healthy long term. Thus, the avian anti-botuiinal toxin antibody 
demonstrated wry good protection from ihe lethal elTecls of the toxin in the experimental 
mice. 
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EXAMPLE 7 

Prodiiciion Of An Avian Antitoxin Againsi Closirulium difficile Toxin A 

Toxin A is a polent cyioioxin secreted by pathogenic strains of ( \ dijricik\ that plays a 
5 direct role in damaging gastrointestinal tissues. In more severe cases of C cliJJkHc 

intoxication, pseudomembranous colitis can develop which may he fatal. This would be 
prevented by neutralizing the effects of this toxin in the gastrointestinal tract. As a first step, 
antibodies were produced againsi a portion of the toxin. The example involved: (a) 
conjugation of a synthetic peptide of toxin A to bovine serum albumin: (b) immunization of 
10 hens with the pcptide-BSA conjugate: and (c) detection of antitoxin peptide antibodies by 
□.ISA. 

ii) Conjugation Of A Synthetic Peptide Of Toxin A To Bovine 
Scrum Albumin 

15 The .synihctic peptide CQ TlDGKKYYrN-NH, (SI;Q II) N():82) was prepared 

commercially (Multiple Peptide Systems. San Diego. CA) and validated lo he pure by 

hieh-pressure liquid chromatograpliy. The eleven amino acids following the cysteine residue 
represent j consensus sequence of a repealed amino acid sequence found in loxin A. (Wren 
ci iti, infect. Immun.. 59:3151-3155 (I991).| The cysteine was added to facilitate 

20 conjugalion lo carrier protein. 

In order to prepare the carrier for conjugalion. BSA (Sigma) was dissolved in 0.01 M 
NaPO,. pi I 7.0 to a llnal conceniraiion of 20 mg/ml and n-maleimidoben/.oyl-N- 
hydroxysuccininiide ester (MBS: Pierce) was dissolved in N.N-climethyl formamide to n 
concentration of 5 mg/ml. MBS solution. 0.51 ml. was added to 3.25 ml of the 13SA solution 

25 and incubated for 30 minutes at room temperature with stirring every 5 minutes. The MBS- 
aclivated BSA was then purified by chromatography on a Bio-Ciel P-IO column (Bio-Rad: 40 
ml bed volume) equilibrated with 50 mM NaPOj. pi I 7.0 buffer. Peak fractions were pooled 
(6,0 ml). 

Lyophilized toxin A peptide (20 mg) was added to the activated BSA mixture, .stirred 
30 until the peptide dis.solvcd and incubated 3 hours at room temperature. Within 20 minutes, 
the reaction mixture became cloudy and precipitates formetl. After 3 hours, the reaction 
mixture was centrifuged at 10.000 x g for 10 min and the supernatant analyzed for protein 
content. No significant protein could be detected at 280 nm. The conjugate precipitate was 
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washed three times with PBS and stored at 4''C. A second conjugation was performed with 
15 mg of activated BSA and 5 mg of peptide and the conjugates pooled and suspended at a 
peptide concentration of 10 mg/ml in 10 mM NaPO^. pH 7.2. 

•>) immunization Of Hens With Peptide Conjugate 

1 wo hens were each initially immunized on day zero by injection into luo 
subcutaneous and two intramuscular sites with I mg of peptide conjugate that uas emulsified 
in CF.A (GIBCO). The hens were boosted on day 14 and day 21 with I mg of peptide 
conjugate emulsilled in IFA (GIBCO). 

c) Detection Of Antitoxin Peptide Antibodies By ELISA 

IgY was puritied from two eggs obtained before immunization (pre-inimune) and two 
eggs obtained .ll and .-52 days after the initial immunization using PEG fractionation as 
described in lixample I. 

Wells of a Q6-well microtiter plate (1-alcon Pro-Bind Assay Plate) were coated 
overnight at 4°C' with 100 ng/ml solution of the toxin A synthetic peptide in PBS. pH 7.2 
prepared by dissolving I mg of the peptide in 1.0 ml of H.O and dilution of PFiS. The pre- 
inimune and immune IgY preparations were diluted in a live-fold series in a huffcr containing 
1% PI-:G 8000 and 0.1% Tween-20 (v/v) in PB.S. pH 7.2. The wells were blocked for 2 
hours at room temperature with 150 mI of a solution containing 5% (Wv) Camation.K. nonfat 
dry milk and 1% PEG 8000 in PBS. pH 7.2. After incubation for 2 hours at room 
temperature, the wells were washed, secondary rabbit anti-chicken IgG-alkaiine pho.sphatasc 
( l:7.S0) added, the wells cashed again and the color dexeiopment obtained as described in 
i;.\aniplc 1. I he results are shown in Table 12. 

TABLE 12 



Dilution Of PtG Prep 


Absorbancc At 410 nm 


Prcinimune 


Immune Anii-Peptidc 


1:100 


0.013 


0,253 


1:500 


0.004 


0,030 


1:2500 


0.004 


0.005 



Clearly, the immune amibodies contain titers against this repeated epitope of toxin A. 
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EXAMPLE 8 

Production Of Avian Antitoxins Against 
Clostridium difficHe Native Toxins A And B 

5 To determine whether avion antibodies are elTective Ibr the neutralization of C 

diflkik' toxins, hens were immunized using native C. difftcilc toxins A and B. The resulting 
egg yolk antibodies were then extracted and assessed for their ability to neutralize toxins A 
and H in ritm. The Hxamplc involved (a) preparation of the toxin imnninouens. (b) 
immunization, (c) purification of the antitoxins, and (d) assay of toxin neutralization activity. 

10 

a) Preparation Of The Toxin Immuno^ens 

Both C. difficile native toxins A and B. and (*. Ji/ficile toxoids, prepared by the 
treatment of the native toxins with formaldehyde, \\ere employed as immunouens. ( \ difficile 
toxoids A Lind B were prepared by a procedure whicli was modillcd from published methods 

15 (l£hrich cf oL. Infect. Immun. 28:1041 (1980). Separate solutions (in PBS) of native (' 

diffiu'ilc toxin A and toxin B (Tech Lab) were each adjusted lo a concentration of 0.20 mg/ml, 
and formaldehyde was added to a fmal concentration of 0.4%. fhc toxin/formaldehyde 
si>lutions were then incubated ai 37*^0' for 40 hrs. free formaldehyde was then removed from 
the resulting toxoid solutions by dialysis against PBS at 4°C. In previously published reports, 

20 this dialysis .step was not performed. Therefore, free formaldehyde must have been present in 
their toxoid preparations. The toxoid solutions were concentrated, usinc a C'entriprep 
concentrator unit (Amicon). ti> a final toxoid coneentration t>f 4.0 mg/ml. I he two resulting 
preparations were designated as toxoid A and toxoid B. 

r. difficile native toxins were prepared by concentrating slock .solutions of toxin A and 

25 toxin B (Tech Lab. Inc), using Centriprcp concentrator units ( Amicon). to a final 
concentration of 4.0 mg/ml. 

b) Immunizsition 

fhc first two immunizations were performed using the toxoid A and toxoid B 
30 immunogens described above. A total of 3 different immunization combinations were 

employed, for the first immunization group. 0.2 ml of toxoid A was emulsified in an equal 
volume of Tiler Max adjuvant (CytRx). Titer Max was used in order to conserve the amount 
of immunogen used, and to simplify the immunization procedure. This immunization group 
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was designated "CTA." For the second immunization group. 0.1 ml of toxoid B was 
emulsified in an equal volume of Titer Max adjuvant. This group was designated "CTB." 
For the third immunization group. 0,2 ml of toxoid A was Urst mixed with 0.2 ml of toxoid 
n. and the resulting mixture was emulsified in 0.4 ml of Titer Max adjuvant. This group was 
designated "CTAB." In this way. three separate immunogcn emulsions were prepared, with 
each emulsion containing a final concentration of 2.0 mg/ml of toxoid A (CTA) or toxoid B 
(CTB) or a mixture of 2.0 mg/ml toxoid A and 2.0 mg/ml toxoid B (C I AB). 

On day 0. White Leghorn hens, obtained from a local breeder, were immunized as 
follows: (iroup CTA. Four hens were immunized, with each hen receiving 200Mg of toxoid 
A. via two intramuscular (I.M.) injections of 50^1 of CTA emulsion in the brea.st area. 
(Jroup CTB. One hen was immunized with 200Mg of toxoid B. via two I.M. injections of 
5()mI of CTB emulsion in the breast area. Group CTAB. Four hens were immunized, with 
each hen receiving a mixture containing 200Mg of toxoid A and 200Mg of toxoid H. via two 
I.M. injections of 100^1 of CTAB emulsion in the breast area. The second immunization was 
performed .S weeks later, on day ."^S. exactly as described for the first immunization above. 

In order to determine whether hens previously immunized with ( Uiffkilc toxoids 
could tolerate subsequent kiostcr immunizations using native toxins, a single hen from group 
CTAB was immunized for a third time, this time using a mixture of the native loxin A and 
native loxin B described in section (a) above (these toxins were not formaldehyde-treated, and 
were u.sed in their acti\e iorm). This was done in order to increase tiie amount (titer) and 
allinit\ of specific antin)xin antibody produced by the hen over iliat achieved by inmiunizing 
with tt.Noids only. On day 62. 0.1 ml of a toxin mixture was prepared which contained 
2()().iii: of native loxin A and 2()0^ig of native loxin B. This toxin mixture was then 
cmuisilk-d in 0.1 mi of filer Max adjuvant. A single CTAB hen was then immunized with 
the resulting immunogcn emulsion. N ia two I.M. injections of I00,ul each, into the breast area. 
This hen was marked with a wing band, and observed for adverse effects for a period of 
appro.ximatcly I week, after whicli lime the hen appeared to be in good health. 

Because the CTAB hen described above tolerated the booster immunization with native 
toxins A and B with no adverse effects, it was decided to boost the remaining hens with 
native toxin as well. On day 70. booster immunizations were performed as follows: Group 
CTA. .A 0.2 ml volume oi" the 4 mg/ml native loxin A solution was emulsified in an ec|ual 
volume of Titer Max adjuvant. Fach of the 4 hens was then immunized with 200ng of native 
loxin A. as described for the toxoid A immunizations above. Group CTB. A 50mI volume 
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of the 4 mg/ml native toxin B solution was emulsified in an equal volume of Titer Max 
adjuvant. The hen was then immunized with 200j.ig of native toxin B. as described for the 
toxoid B immunizations above. Group CTAB. A 0.15 ml volume of the 4 mg/ml native 
toxin A solution was first mixed with a 0.15 ml volume the 4 mg/ml native toxin B solution. 
5 The resulting toxin mixture was then emulsified in 0,3 ml ol' Tiler Max adjuvant. The 3 
remaining hens (the hen with the wing band was not immunized this lime) were then 
immunized with 200|.ig of native toxin A and 200Mg of native toxin B as deseribcd for the 
toxoid A i toxoid B immunizations (CTAB) above. On day 85. all hens received a second 
booster immunization using native toxins, done exactly as described ibr the first boost with 
10 native toxins above. 

All hens tolerated both booster immunizations with native toxins with no adverse 
effecls. As previous lilcralure references describe the use ol' formaldehyde-treated toxoids, 
this is upparenily the first time thai any immunizations have been performed using native (*. 
cli/Jicih' toxins. 

15 

c) Purification Of Antitoxins 

r-ggs were collected from the hen in group (TB 10-12 days following the second 
immunization with toxoid (day 35 immunization described in section (b) above), nnd from the 
hens in groups CTA and CTAB 20-21 days following the second immunization with toxoid. 

20 fo be used as a pre-immune (negative) control, eggs were also collected from unimmunized 
hens from the same flock. Fgg yolk immunoglobulin (IgY) was extracted from the 4 groups 
t^f eggs as described in Example 1 (c). and the llnal IgY pellets were .solubilized in the 
original yolk volume of PBS w ithout thimerosal. Importantly, ihimerosal was excluded 
because it would have been toxic to the CHO cells used in the toxin neutralization assays 

25 described in section (d) below. 

d) Assay Of Toxin Neutralization Activity 

The toxin neutralization activity of the IgY solutions prepared in section (c) above was 
determined using an assay system that was modified from published methods. |Rhrich et al,. 
30 Infect. Immun. 28:1041-1043 (1992): and McCiee ei ai Mierob. Path. l2:333o4l (1992).! 

As additional controls, affinity-purified goat anti-C. tiifficilc toxin A ( lech Lab) and affinity- 
purified goal anli-C. ili/fidlc toxin B ( Tech Lab) were also assayed for toxin neutralization 
activity. 
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The IgY solutions and goal antibodies were serially diluted using F 12 medium 
(GIBCO) which was supplemented with 2% FCS (GIBCOHthis solution will be referred to as 
"medium" Tor the remainder of this Example). The resulting antibody solutions were then 
mixed with a standardized concentration of either native (*. tliOtcik toxin A (Tech Lab), or 
native C. i/i/fici/c toxin B (Tech Lab), at the concentrations indicated below. Following 
incubation ai 37°C for 60 min.. lOOpI volumes of the toxin »• antibody mixtures were added 
to the wells t>f 96-wcil microtitcr plates (Falcon Microtest ill) which contained 2.5 x |0' 
Chinese Hamster Ovary (CHO) cells per well (the C1IO cells were plated on the previous day 
to allow them to adhere to the plate wells). The final concentration of toxin, or dilution of 
antibody indicated below refers to the final test concentration of each reagent present in the 
respeciixe microliter plate wells. Toxin reference wells were prepared which contained CHO 
cells and toxin A or toxin B at the same concentration used for the toxin plus antibody 
mixtures (these wells contained no antibody). Separate control wells were also prepared 
which contained CIIO cells and medium only. The assa> plates were then incubated lor 18- 
24 hrs. in a 37°C. humidified. 5% CO, incubator. On the followinu day, the remaining 
adherent (viable) cells in the plate wells were stained using 0.2% cry.sial violet (Mallinckrodt) 
dissolved in 2% ethanol. lor 10 min. Faccss stain was then removed by rinsing with water, 
and the stained cells were solubilized by adding 100^1 of 1% SDS (dis.solved in water) to 
each well. I'he absorbance ol'each well was then measured at 570 nm, and the percent 
c\ iotoxicit\ til cach lest .sample or mixture was calculated using the following formula: 

% CHO Cell Cytotoxiciry = |1 - (il^£Li£^?V/l)j x 100 

Abs, Control 

I Unlike previous reports which quanlilatc results visually by counting cell rounding by 
microscopy, this txample utilized spectrophotometric methods to quantitaie the ( Jifficilc 
toxin bioassay. In order to determine the toxin A neutralizing activity of the CTA, CTAB. 
and pre-immune IgY preparations, as well as the affiniiy-purilled goat antitoxin A control, 
dilutions of these antibodies were reacted against a 0.i|.ig/ml concentration i)f native toxin A 
(this is the approx. 50% cytotoxic dose of toxin A in this assay system), fhe results are 
shown in l igure .1. 

Complete neutralization of toxin A occurred with the CTA IgY (antitoxin A. above) at 
dilutions of 1:80 and lower, while significant neutralization occurred out to the 1:320 dilution. 
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The CTAB lyiY (amitoxin A + loxin B. above) demonstrated complete neutralization at the 
1:320-1:160 and lower dilutions, and significant neutralization occurred out to the 1:1280 
dilution. The commercially available afFmily-purincd goat antitoxin A did not completely 
neutralize toxin A at any of the dilutions tested, but demonstrated significant neutralization 
5 out to a dilution of 1:1.280. The preimmune luY did not sIkw any toxin A neutralizing 

activity at any of the concentrations tested. These results demonstrate that IgY purified from 
eggs laid by hens immunized with toxin A alone, or simultaneously with loxin A and toxin B. 
is an effective toxin A antitoxin. 

The toxin B neutralizing activity of the CTAB and pre-immune IgY preparations, and 
10 also the affinily-purified goat antitoxin B control was deiermincd by reacting dilutions of 
these antibodies against a concentration of native loxin B of 0,1 ng/ml (approximately the 
50% cytotoxic dose of loxin B in ihc assay system), fhc results arc shown in figure 4. 

Complete neutralization of loxin B occurred with the C l AB IgY (antitoxin A - loxin 
B. above) at the 1:40 and lower dilutions, and significant ncuU'alizaiion occurred out to the 
15 1:320 dilution. The arfinity-purificd goal antitoxin B demonstrated complete neutralization at 
dilutions of 1:640 and lower, and significant neutralization occurred out to a dilution of 
1:2.560. fhe preimmune IgY did not show any loxin B neutralizing activity at any of the 
concentrations tested. Tiicsc results demonstrate that IgY purified from eggs laid by hens 
immunized simultaneously with toxin A and toxin B is an effective toxin B antitoxin. 

In a separate study, the loxin B neutralizing activity of C"I B. C f AB. and pre-immune 
IgY preparations was determined by reacting dilutions oi these antibodies against a native 
toxin B concentration of O.l^g/ml (approximately 100% cytotoxic dose of toxin B in this 
assay system). The results are shown in Figure 5. 

Significant neutralization of toxin B occurred with the CTB IgY (antitoxin B. above) 
25 at dilutions of 1:80 and lower, while the CTAB IgY (antitoxin A < toxin B. above) was found 
to have significant neutralizing activity at dilutions of 1:40 and lower. The preimmune IgY 
did not show any toxin B neutralizing activity at any of the concentrations tested, fhese 
results demonstrate that IgY purified from eggs laid by hens immunized with toxin B alone, 
or simultaneously with toxin A and toxin B* is an effective toxin B antitoxin. 

30 
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EXAMPLE 9 

In vivo Protection Of Golden Syrian Hamsters From 
C. Jiffkilc Disease By Avian Antitoxins Against C. difficih' Toxins A And B 

The most extensively used animal model to study C. diffuUe disease is the hamster. 
ILyerly aL InCect. Immun. 47:349-352 (1992).] Several other animal models for 
antibiotic-induced diarrhea exist, but n.mc mimic the human form of the disease as closely as 
the hamster model. [ R. |-cketN-. "Aninud Models oj AntihionclnJm cJ ( olin,, " in O. Zak and 
M. Sande (eds.). Experimental Models in Aniimicmhial Chemofherupy. Vol. 2. pp.6 1-72. 
(1986). I In this model, the animals arc first predisposed to the disease by the oral 
administration of an antibiotic, such as clindamycin, which alters the population of normally- 
occurring ga.strointestinal llora (Fekety. at 61-72). Followinij the oral challenge of these 
animals with \ iable C. difficile organisms, the hamsters develop cecitis. and hemorrhage, 
ulceration, and inllammation are e\ ident in the intestinal mucosa. (I.yerh el a/.. Infect. 
15 Immun. 47:.149-352 ( 1985).| The animals become lethargic, develop severe diarrhea, and a 
high percentage of them die iVom the disea.se. |Lyerly cl a/ . Infect, Immun. 47:349-352 
(1985). I This model is therefore ideally .suited for the evaluation of therapeutic agents 
designed for the treatment or prophyla.xi.s olC. difficile disease. 

The ability of the avian C. difficile antitoxins, described in Uxample 1 above, to 
20 protect hani.ster.s iVoni ( •. Jifficile disease was evaluated using the Golden Syrian hamster 

model of C. Jifficile infection. I hc fixample involved (a) preparation of the avian difficile 
aniitoxin.s. (b) i„ vivo protection of hamsters from C. difficile disea.sc by tieatment with avian 
antitoxins, and (c) long-temi survival of treated hamsters. 

-5 a) Preparation Of The Avian C (Nf/Jcile Antitoxins 

liggs were collected from hens in groups CTA and CTAB described in CxampIc I (b) 
above. To be used as a pre-immunc (negative) control, eggs wvrc also purcha.sed from a local 
supermarket, iigg yolk immunoglobulin (IgY) was extracted from the 3 groups of eggs as 
described in lixample 1 (c). and the final IgY pellets were solubilized in one fourth the 

30 original yoJk volume of finsureg nutritional formula. 
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b) In vivo Protection Of Hamsters Against C difficile Disease By 
Treatment With Avian Antitoxins 

The avian C. difficile aniiloxins prepared in section (a) above were evaluated for their 
ability to protect hamsters from difficile disease using an animal model system which was 
modi lied from published procedures. jFekcty, at 61-72: Borriello el uL, J. Med. Microbiol.. 
24:53-64 (1^87): Kim el uL. Infect. Immun.. 55:2984-2992 (1987): Borriello el al.. J. Med. 
Microbiol.. 25:19M96 (1988): Oclmee and Avesani. J. Med. Microbiol., 33:85-90 (1990): 
and l-ycily a uL. infect, immun.. 59:2215-2218 (1991).) l-or the study, three separate 
experimenial groups were used. WMih each group consisting of 7 female ( rulden Syrian 
hamsters (Charles River), approximately 10 weeks t)ld and weighing approximately 100 gms, 
each. The three groups were designated "CTA." '^d'AB" and "Pre-immunc.*' These 
designations corresponded to the antitoxin preparations with which the animals in each group 
were treated, llach animal was housed in an individual cage, nnd was oflercd ibod and water 
Ltd lihitiim through the entire length of the study. On day I. each animal was orally 
administered LO ml of one of the three antitoxin preparations (prepared in section (a) above) 
at the following limcpoinis: 0 hrs,. 4 hrs.. and 8 hrs. On day 2, the day 1 ireatmeni was 
repealed. On day 3. at the 0 hr. limepoint. each animal was again administered antitoxin, as 
described abo\ e. .At 1 hr.. each animal was orally administered 3.0 mg of clindamycin-l ICl 
(Sigma) in 1 ml of water. This treatment predisposed the animals to infection with C 
difficile. As a control for possible endogenous i \ difficile colonization, an additional animal 
from the same shipment (untreated) was also administered 3.0 mg of clindamycin-HCI in the 
same manner. I his clindamycin control animal was lelt untreated (and uniniected) for the 
remainder of the .study. Ai the 4 hr. and 8 hr. timepoints. the animals were administered 
antitoxin as described above. On day 4. at the 0 hr. timepoint. each animal was again 
administered antitoxin as described above. At 1 hr.. each animal was orally challenged with 1 
ml of C. difficile inoculum, which contained approx. 100 ( '. difficile strain 43596 organisms 
in sterile saline. difficile strain 43596. which is a serogroup C strain, was chosen becau.se 
it is representative of one of the most frequently-occurring serogroups isolated IVom patients 
with antibiotic-associated pseudomembranous colitis. [Delmce ef ai, J. Clin. Microbiol.. 
28:2210-2214 (I985).| In addition, this strain has been previously demon.straled to be virulent 
in the hamster model of infection. |Delmee and Avesani. .1. Med. Microbiol.. 33:85-90 
( 1990). I At the 4 hr. and 8 hr. timepoints. the animals were administered antitoxin as 
described above. On days 5 through 13. the animals were administered antitoxin 3x per day 
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as described for day I above, and observed for the orisei of diarrhea and deatin. On the 
morning of day 14. the final results of the study were tabulated. These results are siiown in 
Table 13. 

Representative animals from those tliat died in the Pre-lmmune and (TA groups v/crc 
nccropsicd. Viable C Jijjrici/e organisms were cultured from the ceca oftlie.se animals, and 
the gro.ss pathology of the gastrointe.stinal tracts of these animals was consi.sicnt with that 
expected for r. Jifficilc disease (intlamed. distended, hemorrhagic cecum, tilled with watery 
diarrhca-likc material). In addition, the clindamycin comrol animal remained healthy 
ihroughoui the entire study period, therefore indicating that the hamstcns used in the .study had 
not prcN iouslN been colonized with endogenous C. Jifficilc organisms prior to the start of the 
study. |-ollowing the final antitoxin ireaimcm on day 13. a single surviving animal from the 
( T.\ group, and al.so from the C l AB group, was sacrificed and necropsied. No pathology 
was nt)tcd in either animal. 

TABLE 13 



Treat men I Group 


No. Animals Survivint; 


No. Animals Dead 


Pre- Immune 






VTA (Anitloxin A only) 


5 




CTAB (Ancitoxin A - Antiioxin I)) 


7 


0 



I rcatment of hamsters with orally-administered toxin A and toxin B antitoxin (group 
C I AU) successfully protected 7 out of 7 (100%) of the animals from C. ilif/uHc disease. 
I realmcni of hamsters with orally-administered toxin A antitoxin (group CTA) proiccicd 5 
out of 7 (71%) of the.se animals from C. iliffidle disease. Treatment using pre-immunc lyY 
was not protective against C. diffkik' di.sease. as only I out of 7 (14%) of these animals 
survived. These results demonstrate that the avian toxin A aniitoxin and tlic avian toxin A ^ 
toxin B antitoxin ctTcctively protected the hani.stcrs from ( •. ilifficilt; di.sca.sc. I hcsc results 
also suygcst that although the neutralization of toxin A alone confers some degree of 
protection against C. difficik disease, in order to achieve maximal protection, simultaneous 
antitoxin .'\ and antitoxin B activitv is necessarv. 



c) Long-Term .Sur\ival Of Treated Hamsters 

It has been previously reported in the literature that hamsters treated with orally- 
admini.stered bovine amitoxin IgCt concentrate are protected from C. Jiffidlc di.sea.se a.s long 
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as the ireatment is continued, bui when the treatment is slopped, the animals develop diarrhea 
and subsequently die within 72 hrs. [Lyeriy ei al.. Infect. Immiin.. 59(6):22I5-2218 (1 99 1), J 

In order to determine whether treatment of r. difficile disease using avian antitoxins 
promotes long-term survival following the discontinuation of treatment, the 4 surviving 
5 animals in group CTA. and the 6 surviving animals in group CTAB were observed for a 

period of 1 1 days (264 hrs.) following the discontinuation of antitoxin treatment described in 
section (b) above. All hamsters remained healthy through the entire post-treatment period. 
This result demonstrates that not only does treatment with a\ ian antitoxin protect against the 
onsel of diffivHc disease (/.c. it is effective as a prophylactic), it also promotes long-term 
10 .survixal beyond the ireatment period, and thus provides a lasting cure. 

EXAMPLE 10 

In vivo freaimeni 0\' Lsiablishcd (*. di/fici/c Infcclion in Golden 
Syrian Hamsters With Avian Antitoxins Against dij/icilc Toxins A And R 

riic ability of the avian ( diffici/c aniiloxins. described in llxaniple 8 above, lo treat 
an established ( difficiie infection was evaluated using the (ioldcn Syrian hamster model. 
1'he lixampic involved (a) preparation of ihe avian C. difficile aniiloxins. (b) in vivo treatment 
of hamsters with established ( difficile infection, and (c) histologic evaluation of cecal tissue. 

a) Preparation Of The Avian C\ difficiie Antitoxins 

r.ggs were collected from hens in group CI'AB described in l-xample K (b) above, 
which were immunized with (*. difficile toxoids and native toxins A and B. tiggs purcha.sed 
from a lt>cal supermarket were used as a pre-immune (negaiivc) control, ligg yolk 
immunoglobulin (IgY) was extracted from the 2 groups of eggs as described in Example I 
(e). and the fmal IgY pellets were solubilized in one-fourth the original yolk N'olume of 
Ensure-tt' nutritional formula. 

b) lit vivo Treatment Of Hamsters With Established C. difficile 
30 Inrcction 

The avian difficile antitoxins prepared in section (a) above were evaluated for ihe 
ability to treat established difficUc infection in hamsters using an animal model system 
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which was modified from the procedure which was described for the hamster protection study 
in Example 8(b) abtivc. 

For the study. Ibur separate experimental groups were used, with each group consisting 
or 7 lemale Golden Syrian hamsters (Charles River), approx. 10 weeks old. weighing 
approximately 100 gms. each. Each animal was housed separately, and was offered food and 
water uil lihiitim through the entire length of the study. 

On day I of the study, the animals in all four groups were each prcdi.sposed to (*. 
iliM-ik- infection by the oral administration of r>.0 mg of clindamycin-IICl (Sigma) in I ml ol 
water. 

On day 2. each animal in all four groups was orally challenged with I mi of 
C. ilWkilc inoculum, which contained approximately 100 C. dijfidic strain 43506 organisms 
in sterile saline. ( Jilficih strain 43596 was chosen because it is representative of one of the 
most iVcquently-tK-eurring .scrogroups isolated from patients with aniibiotic-associaied 
pseudomembranous colitis. [Delmee el al.. .1. Clin. Microbiol.. 28.2210-2214 (19Q0).| In 
addition, as this was the .same ( <IHficilf strain used in all of the previous I-xamples above, it 
was again u.sed in order to provide e.\pcrimental continuity. 

On day 3 of the .stud\' (24 iirs. post-infection), treatment was started for two of the 
lour groups of animals. Each animal of one group was orally admini.stered 1.0 ml of the 
CTAH IgY preparation (prepared in section (a) above) at the following timepoinis: 0 hrs.. 4 
hrs.. and 8 lirs. I hc animals in this group were designated "CTAB-24." The animals in the 
second group were each oralh' administered 1.0 ml of the pre-immune IgY preparation (al.so 
prepared in .section (a I above) at the same timepoinis as for the C-fAB group, fhese animals 
were designated "l're.24." Nothing was done to the remaining two groups of animals on day 

( )n day 4. 48 hrs. post-infection, the treatment described for day 3 above was repeated 
for the CTAB-24 and Pre-24 groups, and was initiated for the remaining two groups at the 
same imiepoints. The final two groups of animals were designated "CTAU-48" and "l>re-48" 
respectivelv . 

On days 5 through y. the animals in all four groups were administered antitoxin or 
pre-immune IgY. 3x per day. as described for day 4 above. The four experimental groups are 
summarized in Table 14. 
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TABLE 14 

Experimental Treatment Groups 


( iroUD Dcsicinnt inn 


Experimental Ireatmcnt 


CTAf}-24 


iniccicu. ireaimeni w/aniHOxin lUY sinrteii (w i4 nrs. post'intcction. 


l're-24 


Infected, treatment w/pre-immune luY started (fi) 24 hrs. post- infection. 


CTAU-48 


Infected, treatment w/anlitoxin IjsY stalled (w 48 hrs. posi-infection. 


Pre-48 


Infected, treatment w/prc-immune IgY started dp 4X lirs. post- in feci ion. 



All aninials were observed for the onset of diarrhea and death through the conclusion 
of the sludy on the morniny: of day 10. The results of this study arc displayed in Table 15, 

TABLE 15 

Experimental Outcome-Day 10 



f fcainient Group 


No. Animals Surviviniji 


No. Animals Dead 


(■ rAa-24 


6 


1 


Pre-24 


0 


7 


(:TAn-4x 


4 


^1 


Pre-48 


2 


5 



Eighty-six percent of the animals which began receiving irealmeni with antitoxin IgY 
at 24 hrs. post-infection (CTAB-24 above) survived, while 37% of the animals treated with 
antitoxin IgY slariing 48 hrs. posl-infeciion (CTAB-48 abtive) survived. In conira.st. none ol" 
the anitnals receiv ing prc-immune IgY .starling 24 hrs. post- infection (Prc-24 above) survived, 
and only 29% of the animals which began receiving ireatmcnt with prc-immune IgY at 48 
hrs. post-infection (Pre-48 above) survived ihrough the conclusiiMi of the sludy. I hcse results 
demonstrate that avian antitoxins raised against difficUc loxins A and D are capable of 
successfully Healing cslablished C. difficile infections in vivo, 

c). Histologic Evaluation Of Cecal Tissue 

In order to further evaluate the ability of ihe IgY preparations tested in this study U» 
treat esiablisheij (' cliffivHc infection, histologic evaluations were performed on cecal li.ssue 
specimens obtained from represeniaiive animals from the study described in section (b) abiive. 

Immediately following death, cecal lissuc specimens were removed from animals 
which died in the Pre-24 and Pre-48 groups. Following the completion of the study, a 
representative surviving animal was sacrificed and cecal tissue specimens were removed from 
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the CTAB-24 and CTAB-48 groups. A single untreated animal from the same shipmcnl as 
those used in the study was also sacrificed and a cecal tissue specimen vvas removed as a 
normal control. All tissue specimens were fixed overnight at 4'C in 10% buttered formalin. 
The fixed tissues were paraffin-embedded, sectioned, and mounted on glass microscope slides. 
The ti.ssue sections were then stained using hematoxylin and eo.sin (II and H stain), and were 
examined by light microscopy. 

I 'pon examination, the tissues obtained from the CTAB-24 and CTAB-48 animals 
showed no pathology, and were indistinguishable from the normal control. This observation 
provides further evidence for the ability of avian antitoxins raised against C. difrwih toxins A 
and B to effectively treat established C". dmcik' infection, and to prevent the pathologic 
consequences which normally occur as a result of C. diffkik- disease. 

In contrast, characteristic .substantial mucosal damage and destruction wns observed in 
the tissues of the animals irom the l'rc-24 and I>re-48 groups which died from ( •, diffkilc 
disea.se. Nomial ti.ssue architecture was obliterated in these uvo preparations, as most of the 
mucosal la>er was ob.served to have sloughed away, and there were numerous large 
hemorrhagic areas containing massive numbers of erythrocytes. 

examplb: 1 1 

Cloning And E.xpression Of C. diflkih To.xin A I ragments 

The toxin A gene has been cloned and .sequenced, and shown to encode a protein of 
predicted MW of m kd. [Dove et a/., infect. Immun.. 58;4K()-488 Given the 

cxpen.sc und difficuliy of isolating nati%e toxin A protein, it would be advantageous to use 
smiple and inexpensive procaryolic expression sy.stems to produce and purify high levels of 
recombinant toxin A protein for immuni2ation purposes. Ideally, the isolated recombinant 
protein would be soluble in order to preserve native antigenicity, since .solubilized inclu.sion 
bi>d\ proteins often do not lold into native conformations. To allow ease of purification, the 
recombinant protein .should be expres.scd to levels greater than I mg/liter of E. cnli culture 

To determine u hether high levels of recombinant toxin A protein can be produced in 
E. coli. fragments ol the toxin A gene were cloned into various prokaryotic expression 
N cctors. and assessed liir the ability to express recombinant toxin A protein In £. vali. Three 
prokaryotic expression systems were utilized. These systems were chosen because they drive 
expres.sion of either fusion (pMALc and pGGX2T) or native (pl£T2.3a-c) protein to high levels 
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in E. volL and allow affinity purification of the expressed protein on a ligand containing 
column. Fusion proteins expressed from pGEX vectors bind glutathione agarose beads, and 
are eluted with reduced glutathione. pMAL fusion proteins bind amylose resin, and are eluted 
with maltose. A poly-histidine lag is present at either the N-icrminal (pRTl6b) or C-icrminal 
(pET23a-c) end of pET fusion proteins. This sequence specifically binds Ni," chelate columns, 
and is eluted wiih imidazole salts. Lxicnsive descriptions of these vectors arc available 
I Williams I'l aL (1995) DNA Cloning 2: Expression Syswms. Glover and Hamcs. cds. IRL 
Press. Oxford, pp. I5-58J. and will not be discussed in detail here. The IZxample involved (a) 
cloning of the toxin A gene, (b) expression of large fragments of toxin A in various 
prokaryotic expression systems, (c) identification of smaller toxin A gene fragments that 
express efficiently in coli. (d) purification of recombinant toxin A protein by affinity 
chromatography, and (e) demonstration of functional activity of a recombinant fragment of the 
toxin A gene. 

a) Cloning Of The Toxin A Gene 

A restriction map of the toxin A gene is shown in Figure (i. fhe encoded protein 
contains a carboxy terminal ligand binding region, containing multiple repeats of a 
carbohydrate binding domain. |von L'ichel-Streiber and Saucrborn, Ciene 96:107-1 13 ( 19W).] 
The toxin A gene was cloned in three pieces, by using either the polymerase chain reaction 
(PCR) to amplify specific regions, (regions 1 and 2. Tigure 6) or by screening a constructed 
genomic library for a specific toxin A gene fragment (region 3. figure 6). The sequences of 
the utilized PCR primers are PI: 5* (iCiAAATT TACiCTCiCACiCA rC TCiAC 3' {SIX; ID 
N().:h: P2: TCTAGCAAATTCGCTTGT GTTGAA 3' (SHQ ID N().:2); P3; 5* 
CTCGCA fATAGCATTAGACC 3* (SEQ ID NO.:3): and P4: 5* 
CTATCTAGGCCTAAAGTAT 3' (SEQ ID N0.:4). These regions were cloned into 
prokaryotic expression vectors that express either fusion (pMAI.c and pGEX2'r) or native 
(pET23a-c) protein to high levels in E. colL and allow affinity purification of the expressed 
protein on a ligand containing column. 

Clostridium JifficHc VPI strain 10463 was obtained from the A'I CC (ATCC ^43255) 
and grown under anaerobic conditions in brain-heart infusion medium (BBL). High 
molecular-weight C. clifficHe DNA was isolated essentially as described by Wren and 
l abaqchali (1987) .1. Clin. Microbiol., 25:2402, except proteinase K and sodium dodecyl 
sulfate (SDS) was used to disrupt the bacteria, and celyltrimethylammonium bromide 



. 74 - 



wo 98/08540 



PCTAJS97/15394 



precipilaiion [as described in Ausubel ei aL Current Protocols in Molecular Biology 
was used to remove carbohydrates from the cleared lysaic. The integrity and yield of genomic 
DNA was assessed by comparison with a serial dilution of uncut lambda DNA after 
electrophoresis on an agarose gel. 

Fragments I and 2 were cloned by PCR. utilizing a proofreading thermostable DNA 
polymerase (native pfu polymerase; Stratagene). The high fidelity of this polymerase reduces 
the mutation problems associated with amplification by error prone polymerases (c'.^,, Taq 
polymerase). PCR amplification was performed using the indicated PCR primers (Figure 6) 
in 50 Ml reactions containing 10 mM Tris-HCI(8.3), 50 mM KCI. 1.5 mM MgCl. 200 
each dNTP. 0.2 \xM each primer, and 50 ng C Jifficile genomic DNA. Rcaciions were 
overlaid with 100 m1 mineral oil. heated lo 94*^C for 4 min. 0.5 y\\ native /;/// polymerase 
(Stratagene) added, and the reaction cycled 30x al 94°C for I min. 50°C for I min. IT'C for 
4 min. followed by 10 min at 72°C. Duplicate rcaciions were pooled, chloroform extracted, 
and ethanol precipiiaied. After washing in 70% eihanol. the pellets were resuspended in 50 \i\ 
TF buffer (10 mM Tris-HCL. 1 mM EDTA pH 8.0]. Aliquois of IO|.il each were restriction 
diiicsied with cither £cv;Rl///mc-ll (fragment I) or EcoKMP.siX (fragment 2), and the 
appropriate resiriciion fragments were gel purified using the Prep-A-Gene kit (BioRad). and 
liuatcd to cither /:a;Rl/.SV?;^/Nrestricted p(]EX2T (Pharmacia) sector (fragment 1). or the 
ELo\\\fPst\ pMAIc (New England Bioiabs) vector (fragment 2). Both clones are predicted to 
produce in-frame fusions with either the glulathione-S-transferase protein (pGRX vector) or 
the maltose binding protein (pMAL vector). Recombinant clones were isolated, and 
confirmed by restriciif>n digestion, using standard recombinant molecular biology ttH:hniques. 
I-Sambrook a al.. Molecular Cloning, A Laboratory Manual (1989). and designated pGA30- 
660 and pMA660-l 100. respectively {.sec Figure 6 for description of the clone designations).) 

I'ragment .1 was cloned from a genomic library of size selected Pst\ digested 
( \ difficile genomic DNA. using standard molecular biology techniques (Sambrook et al.). 
Given that the fragment 3 internal /^^7l site is protected from cleavage in C\ clifficilc genomic 
DNA (Price et al.. Curr. Microbiol,. 16:55-60 (1987)]. a 4.7 kb fragment from Pstl restricted 
( Jifficile genomic DNA was gel purified, and ligated to P.stl restricted, phosphatase treated 
plJC9 DNA. The resulting genomic library was screened with a oligonucleotide primer 
specific to fragment 3. and multiple independent clones werc isolated. The presence of 
fragment 3 in several of these clones was confirmed by restriction digestion, and a clone of 
the indicated orientation (Figure 6) was restricted with BamUUHincfllL the released fragment 
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purified by gel electrophoresis, and ligated into similarly rcslricied pET23c expression vector 
DNA (Novagen). Recombinant clones were isolated, and confimied by restriction digestion. 
This construct is predicted to create both a predicted in frame fusion with the pET protein 
leader sequence, as well as a predicted C-terminal poly-histidinc aftinily lau. and is designated 
pPAl 100-2680 (see l-igure 6 for the clone designation). 

h) Kxprcssion Of Large Fragments Of Toxin A In £. coli 
Protein expression from the three expression constructs made in (a) was induced, and 
analyzed by Western blot analysis with an affinity purified, goal polyclonal antiserum directed 
against the toxin A toxoid (Tech Lab). The procedures utilized for protein induction, SDS- 
I'AGli. and Western blot analysis are described in detail in Williams at {\W5), supra. In 
brief. 5 ml 2X YT (16 g iryptone. 10 g yeast extract. 5 g NaCI per liter, pi I 7.5 + 100 ^ig/ml 
ampicillin were added to cultures of bacteria (BL21 for pMAl and pCH-X plasmids. and 
BL2l([)h.l)LysS for pET plasmids) containing the appropriate recombinant clone which were 
induced to express recombinant protein by addition ol* IPT(i to 1 mM. Cultures were grown 
ai 3TL\ and induced when the cell density reached 0.5 ()D,,j,,. Induced protein was allowed 
to accumulate for two hrs after induction. Protein samples were prepared by pelleting I ml 
aliquois of bacteria b\ cenlrifugalion (1 min in a microfuge), and resuspension of the pelleted 
bacteria in 150 ^il of 2x SDS-PAGE sample buffer [Williams c/ oL (IW5). supra]. The 
samples were heated to ^S^'C for 5 min. the cooled and 5 or 10 ^il aliquots loaded on 7.5% 
SnS-PAC}l: gels. liioRad high molecular weight protein markers were also loaded, to allow 
estimation ol' the MW of ideniiiled fusion proteins. After electrophoresis, protein was 
delected cither generally by .staining gels with Coomassie blue, m* specincally. by blotting to 
nitrocellulose for Western blot detection of specific imniunoreactive protein. Western blots, 
(performed as described in Example 3) which detect toxin A reactive protein in cell lysiUes of 
induced protein from the three expression constructs are shown in Figure 7. In this figure, 
lanes 1-3 contain cell lysates prepared from E, coli strains containing pPA 1100-2860 in 
BI2l(l>Ii3)lysE cells: lanes 4-6 contain cell lysates prepared from E. coli strains containing 
pPAl 100-2860 in BI2l(Dl:3)lysS cells: lanes 7-9 contain cell lysates prepared from E. coli 
strains containing pMA30-660: lanes 10-12 contain cell lysates prepared from coli strains 
containing pMA660-l 100. The lanes were probed with an affinity purified goat antitoxin A 
polyclonal antibody ( Tech I.ab). Control lysates from uninduced cells (lanes I. 7. and 10) 
contain \ ery little imniunoreactive material compared to the induced samples in the remaining 
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lanes. The highest molecular weight band observed for each clone is consistent with the 
predicted size of the full length fusion protein. 

liach construct directs expression of high molecular weight (HMW) protein that is 
reactive with the toxin A antibody. The size of the largest immunoreactive bands from each 
sample is consistent with predictions of the estimated MW of the intact fusion proteins. This 
demonstrates that the three fusions are in-frame, and that none of the clones contain cloning 
artifacts that disrupt the integrity of the encoded fusion protein. However, the Western blot 
demonstrates that fusion protein from the two larger constructs (pGA30-6(>0 and pPAl 100- 
2680) are highly degraded. Also, expression levels of toxin A proteins from these two 
constructs are low. since induced protein bands are not visible by Coomassic staining (not 
shown). Several other expression constructs that fuse large sub-regions of the toxin A gene to 
either pMALc or pCT23a-c expression vectors, were constructed and tested for protein 
induction, rhe.se con.structs were made by mixing gel purillcd restriction fragments, derived 
iVom the cxprcwion con.structs shown in Figure 6. with appropriately cleaved expression 
vectors, ligating. and selecting recombinant clones in which the toxin A restriction fragments 
had ligated together and into the expression vector as predicted for in-framc fusions. The 
expressed toxin A interval within these constructs are shown in Figure «. as well as the 
internal restriction sites utilized to make these constructs. 

As used herein, the term "interval" refers to any portion {i.e.. any .segment of the toxin 
which is less than the whole toxin molecule) of a clostridial toxin. In a preferred 
embodiment, "interval" refers to portions of C. t/////t77f toxins such as toxin A or toxin 13. It 
IS also contemplated that these intervals will correspond to epitopes of immunctlogic 
nnportancc. such as antigens or immunogens against which a neutralizing antibody response is 
eltccted. It is not intended that the present invention be limited to the particular intervals or 
sequences described in these Examples. It is also contemplated that sub-portions of intervals 
(t'..i:.. an epitope contained within one interval or which bridges multiple intervals) be used as 
compositions and in the methods of the present invention. 

In all cases. Western blot analysis of each of these constructs with goat antitoxin A 
antibod) (Tech Lab) detected HMW fusion protein of the predicted size (not shown). Yhis 
confirms that the reading frame of each of these clones is not prematurely terminated, and is 
fu.sed in the correct tianie with the fusion partner. However, the Western blot analysis 
revealed that in all cases, the induced protein is highly degraded, and. as assessed by the 
abijence of identifiable induced protein bands by Coomassic Blue staining, are expressed only 
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ai low levels. These resulls suggest thai expression of high levels of Intact loxin A 
recombinant protein is not possible when large regions of the loxin A gene arc expressed in 
£ coli using these expression vectors. 

c) High Level Expression Of Small Toxin A Protein Fusions in 
E. coli 

l:xpericnce indicates that expression difficulties are ofien encountered when large 
(greater than 100 kd) fragments are expressed in /;. volL A number of expression constructs 
containing smaller fragments of the loxin A gene were constructed, to determine if small 
regions ol' the gene can be expressed to high levels without cNlensive protein degradation. A 
summary of these expression constructs arc shown in Figure 9. All were constructed by in- 
frame fusions of convenient loxin A restriction fragments lo either the pMALc or pET23a-c 
\ ectors. Protein preparations I'rom induced cultures of each of these constructs were analyzed 
by both Coomassie Blue staining and Western analysis as in (b) above. In all cases, higher 
levels of intact, full length fusion proteins were observed than with the larger recombinants 
from section (b). 

cl) Purificntion Of Recombinant Toxin A Protein 

Large scale (500 ml) cultures of each recombinant from (c) were grown, induced, and 
soluble and insoluble protein fractions were isolated. The .soluble protein extracts were 
affmily chromatographed lo isolate recombinant lusion protein, as described { Williams et aL 
(IW4). supra]. In brief, extracts containing tagged pl:'r fusions were chromatographed on a 
nickel chelate column, and elutcd using imidazole salts as described by the di,stributor 
(Novagen). Extracts containing soluble pMAL fusion protein were prepared and 
chromatographed in column buffer (10 niM NaPOj, 0.5M NaCI. 10 mM P-mereaptoethanol. 
pH 7.2) over an amylose resin column (New England Biolabs). and cluied with column 
buffer containing 10 mM maltose as described [Williams t7 uL ( 1995). supval When the 
expressed protein was found lo be predominantly insoluble. in.soluble protein extracts were 
prepared by ihc method described in [-xample 17. infrtL fhe resulls are summarized in Table 
16. j-igure 10 shows the sample purifications of recombinant toxin A protein. In this figure, 
lanes I and 2 contain MBP fusion protein purified by afllnily purification of soluble protein. 
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TABLE 16 



Clone 


Protein 
Solubility 


Yield Affinity 
Purified Soluble 
Protein 


% intact Soluble 
Fusion Protein 


Yield Intact 
insoiUDie rusion 
Protein 


PMA30-270 


Soluble 


4 mg/500 mis 


10% 


NA 


l*MA30-300 


Soluble 


4 mg/500 mis 


5-10% 


NA 


pMA300-660 


Insoluble 




NA 


10 mg/500 ml 


pMA66()-l 100 


Soluble 


4.5 mg/500 mis 


50% 


NA 


pMAl 100-1610 


Soluble 


18 mg/500 mis 


10% 


NA 


r\KA A liCm 1 O^n 
piVIA lOIII-l A/ll 


Both 


22 mg/500 mis 


90% 


20 mc/500 ml 


pMA 1450.(870 


insoluble 




NA 


0.2 mg'500 ml 


pPAnOO.|450 


Soluble 


0.1 mg/500 mis 


90% 


NA 


pPAl 100-1870 


Soluble 


0.02 mc/500 mis 


90% 


NA 


pMA 1870-2680 


Both 


12 mg.'500 mis 


80% 


NA 


pPa 18 70-2680 


Insoluble 




NA 


10 mg/500 ml 



'III pP --' pdT2j vector. pM'pMAl.c vecior. A=toxin A. 

^^^^'^ °" ODj,., ^ I mpm\ (exiinciion cocfTicieni of'MBP). 
listimaied by Cuoninssic scaining ofSDS-PAGE gels. 

I.ancs .1 and 4 contain MBP fusion proicin purified by solubilization of insoluble inclusion 
bodies. The purified fusion protein samples are pMA 1 870-2680 (lane I). pMA660-l 100 (lane 
2). pMA300-600 (lane 3) and pMA1450-l870 (lane 4). 

WwT > ie!ds of affinity purified protein were obtained when poly-histidine tagged pET 
Ncciors were u.sed to drive expression (pPAHOO-HSO. pPI 100-1870). However, significant 
protein yields were obtained from pMAI. expression constructs spanning the entire to.xin A 
l?ene. and yields of full-length soluble fusion protein ranged from an estimated 200-400 
Mg/500 ml culture (pMA30-3OO) to greater than 20 mg/500 ml culture (pMA 161 0-1 870). 
Only one interval was expressed to high levels as strictly insoluble protein (pMA300-660). 
Thus, although high level expression was not observed when using large expression constructs 
from tiic toxin A gene, usable levels of recombinant protein spanning the entire toxin A gene 
were obtainable by Isolating induced protein from a series of .smaller pMAI, expre.s.sion 
constructs that span the entire toxin A gene. This is the first demonstration of the feasibility 
of expressing recombinant toxin A protein to high levels in E. voli. 
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c) Hemagglutination Assay Using The Toxin A Recombinant 
Proteins 

riic carboxy terminal end consisting of the repealing units contains the 
hemagglutination activity or binding domain of ( difficile toxin A. To determine whether 
5 the expressed toxin A recombinants retain functional activity, hemagglutination assays were 

performed. Two toxin A rccombinnnt proteins, one containing the binding domain as cither 
soluble afllniiy purified protein (pMA 1 870-2680) or SDS solubilized inclusion body protein 
(pPA 1 870-2680) and soluble protein from one region outside that domain (pMAI 100-1610) 
were tested using n described procedure. (H.C\ Krivan ct. ui. Infect. Inimun.. 53:573 

HI (1986).) t itrated rabbit red blood cells (RRBC)(Cocalico) were washed several limes with 
*fri.s-bul'fer ( O.IM Iris and 50 mM NaCI ) by ccnirifugation at 450 x g for 10 minutes ai 4"* 
C. A 1% RRBC suspension was made from the packed cells and resuspcndcd in I ris-buffer. 
Dilulions of the rccombinanl proteins and native toxin A ( Tech Labs) were made in the fris- 
buffcr and added in duplicate to a round-boltomed 06-well niicroiiler plate in a final volume 

1^ i»f 100 Ml. To each well. 50 |al of the 1% RRBC suspension was added, mixed by gentle 
lapping, and incubated ai 4°C' for 3-4 hours. Significant hemagglutination occurred only in 
the recombinant proteins containing the binding domain (pMA 1X70-2680) and native loxin 
A. The recombinant protein outside the binding domain (pMA 1 100-1610) displayed no 
hemagylulinalion activity. Using equivalent protein conccniraiions. the hemagglutination tiler 

20 for loxin A was 1:256. while tilers for the soluble and insoluble recombinant proteins ol the 
binding domain were 1:256 and about 1:5000. Clearly, the recombinant proteins tested 
rciaincd functional activity and were able to bind RRBC's. 

EXAMPLE 12 

25 l-unciional Activity Of IgY Reactive Against Toxin A Recombinants 

fhe expression oC recombinant toxin A protein as multiple fragments in Exoli has 
demonstrated the feasibility of generating toxin A antigen through use of recombinant 
methodologies (Kxample 1 1). The isolation ofthc.se recombinant proteins allows the 
30 immunoreactivity of each individual subregion of the toxin A protein to be determined (/.e.. 
in a antibody pool directed against the native toxin A protein). This identifies the regions {if 
any) for which little or no antibody response is elicited when the whole protein is used as a 
immunogen. Antibodies directed against specific fragmenls of ihe loxin A protein can be 
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purified by affinity ciiromatography against recombinant toxin A protein, and tested for 
neutralization ability. This identifies any toxin A subregions tliat arc essential for producing 
neutralizing antibodies. Comparison with the levels of immune response directed against 
these intervals when native toxin is used as an immunogen predicts whether potentially higher 
liters of neutralizing amibtnlics can be produced by using recombinant protein directed against 
a individual region, rather than the entire protein. Finally, .since it is unknown whether 
antibodies reactive to the recombinant toxin A proteins produced in F.xample 1 1 neutralize 
loxin A as effectively as antibodies raised against native toxin A (Hxamples 9 and 10). the 
protective ability of a pool of amibodies alllnity purified against recombinant toxin A 
fragments was as.ses.sed for its ability to neutralize loxin A. 

This Rxamplc in^■olvcd (a) epitope mapping of the toxin A protein to determine the 
litre of specific antibodies directed against individual subregions of the toxin A protein when 
naiixe loxin A protein is u.scd as an immunogen. (b) aflinity purification of IgY reactive 
against recombinant proteins spanning the toxin A gene, (c) loxin A ncutruli/luon assav.s with 
aninit> purified IgY reactive to recombinant toxin A protein u, identify subregions of ihc 
loxin A protein that induce the pmduction of neutralizing antibodies, and determination of 
whether complete neutralization of toxin A can be elicited with a mixture ol' antibodies 
reactive to recombinant toxin A protein. 

a) Epitope Mapping Of The Toxin A Gene 

I hc afiinity puriticaiion of recombinani toxin A protein specific lo defined intervals of 
ilic t.>xin A protein allows epitope mapping of aniibocU pools directed against native toxin A. 
This has not prev iously been possible, since previous expression of toxin A recombinants has 
been a.s.sessed onlv by Western blot analysis, without knowledge of the expression levels of 
ihe protein [cy.. von Hichel-Streiber ci al. J. Gen. Microbiol.. 135:55-64 ( 1989)). Thus, high 
or low reactivity of recombinant toxin A protein on Western blots may rcfiect protein 
expression level differences, not inimunoreactivit.v differences. Given that the purified 
recombinant protein generated in L.xample 1 1 have been quaniitated. the i.ssuc of relative 
immunoreactiviiv of individual regions of the loxin A protein was precisely addressed. 

l or the purposes of this F-xample. the toxin A protein was subdivided into 6 intervals 
(1-6). numbered from the amino (interval 1) to the carboxyl (inter\-al 6) termini. 

The reci,mbinant proteins corresponding to these intervals were from expression clones 
(.see I-:.xample I Ud) for clone designations) pMA30-300 (interval I). pMA300-660 (interval 
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2). pMA660-l 100 (interval 3), pPAl 100-1450 (interval 4K pMAl450-1870 (interval 5) and 
pMA 1 870-2680 (interval 6). These 6 clones were selected because they span the entire 
protein from amino acids numbered 30 through 2680. and suhdix ide the protein into 6 small 
intervals. Also, the carbohydrate binding repeal interval is contained specifically in one 
5 interval (interval 6). allowing evaluation of the immune response specifically directed against 
this region. Western blots of 7.5% SDS-PAGH gels, loaded and electrophoresed with defined 
quantities of each recombinant protein, were probed with cither goat antitoxin A polyclonal 
antibody ( Tech Lab) or chicken antitoxin A polyclonal antibody |pC TA IgY. lixampic 8(c)]. 
The bh^ts were prepared and developed with alkaline phosphatase as previously described 

10 (Williams ef al. (1095). supraY Al least 00% of all reactivity, in cither goal or chicken 

antibody pools, was found to be directed against the liuand binding domain (inierval 6). The 
remaining immunoreaciivity was directed against all five remaining intervals, and was similar 
in both antibody pools, except that the chicken antibody sh(nved a much lower reactivity 
againsi interval 2 than the goat antibody, 

15 This clearly demonstrates that when native toxin A is used as an immunogen in goals 

or chickens, the bulk of the immune response is directed against the iigand binding domain of" 
ihc protein, with the remaining response distributed throughout the remaining 2/3 of the 
protein. 

20 h) .\fnnity Purincation (>r IgY Reactive Against Recombinant 

Toxin A Protein 

.-MTmity columns, containing recombinant toxin A protein from the 6 defined intervals 
in (a) above, were made and used to (i) afi'miiy purify antibodies reactive to each individual 
inierval from ihe CTA IgY preparation |I-xampIe 8(c)l. and (ii) deplete interval specific 

25 anlibodie.*; from the CTA IgY preparation. Affinity columns were made by coupling 1 ml of 
l*BS-washed Actigel resin (Sterogene) with region specific protein and 1/10 final volume of 
Aid-coupling solution (IM sodium cyanoborohydride). The total region specific protein 
added to each reaction mixture was 2.7 mg linterval 1). 3 mg (intervals 2 and 3). 0.1 mg 
(interval 4). 0.2 mg (interval 5) and 4 mg (interval 6). Protein for intervals 1. 3. and 6 was 

30 affinity purified pMAI fusion protein in column buffer (see lixample 1 1). Interval 4 was 
affinity purified poly-histidine containing pET fusion in PBS: intervals 2 and 5 were from 
inclusion body preparations of insoluble pMAI. fusion protein, dialyzed extensively in PBS. 
Aliquois of the supernatants from the coupling reactions, before and after coupling, were 
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assessed by Coomassie siaining of 7.5% SDS-PAGE gels. Based on protein band intensities, 
in all cases greater than 50% coupling eniciencies were estimated. The resins were poured 
into 5 ml BioRad columns, washed extensively with PBS. and stored at 4''C. 

Aliquots of the CTA IgY polyclonal antibody preparation were depleted for each 
individual region as described below. A 20 ml sample of the CTA IgY preparation IFlxample 
X(c)| uas dialyzed extensively against 3 changes of PBS ( I liter for each dialysis), quantitated 
by iiKsorbance at ODj.„,. and stored at 4»C. Six I ml aliquots of the dialyzed IgY preparation 
were removed, and depleted individually for each of the six intervals. i:ach 1 ml aliquot was 
pas.sed over the appropriate affinity column, and the eluate twice reapplied to the column. 
The eluate was collected, and pooled with a I ml PBS wash. Bound antibody was eluted 
from the column by washing with 5 column volumes of 4 M Ciuanidinc-IICl (in 10 mM I riij- 
IK I. pi I 8.0). The column was reequilibrated in PBS. and the depleted antibody stock 
reapplied as described above. The eluate was collected, pooled with a I ml PBS wash, 
quantitated by absorbance at ()D.,,„. and stored ai 4° C. In this manner. 6 aliquots of the CTA 
IgY preparation were individually depleted for each of the 6 toxin A intervals, by two rounds 
of affmiiy depletion. The specificity of each depleted stock was tested by Western blot 
analysis. Multiple 7.5% SDS-PAGL gels were loaded with protein samples corresponding lo 
all 6 toxin A siibregions. Al'tcr electrophoresis, the gels were blotted, and protein transfer 
confirmed by Ponceau S staining [protocols described in Williams ci ttl. (1W5). stipnt\. After 
blocking the blots I hr at 20°C in PBS+ 0.1% Tween 20 (PB.ST) containing 5% milk (as a 
blocking buffer). 4 ml of eitlicr a 1/500 dilution of the dialyzed CTA IgY preparation in 
blocking buffer, or an cqui\alent amount of tlie six depleted antibody stocks (using 01).,,, to 
standardize amibodv concentration) were added and the blots incubated a further i hr at room 
temperature. The blots were washed and developed with alkaline pho.sphaiase (using a rabbit 
anti-chicken alkaline pliosphate conjugate as a secondary antibody) as previouslv described 
1 Williams el at. (1995). .\7(/w«). In all cases, only the target interval was depleted for antibody 
reactivity, and at least 90% of the reactivity to the target intervals was specifically depleted. 

Region specific antibody pools were isolated by affinity chromatography as described 
below. I on mis of the dialyzed CTA IgY preparation were applied sequentially to each 
affinity column, such that a single 10 ml aliquot was used to isolate region s|iecific antibodies 
specific to each of the six subregions. The columns were sequentially washed with 10 
volumes of PBS. 6 volumes of BBS-Tween. 10 volumes of TB.S. and eluted with 4 ml 
Actisep eiution media (Sterogene). The eluate was dialyzed extensively against .several 
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chanues of PBS. and ilie affinity purified antibody collected and stored at 4**C. The volumes 
of the eluaie increased to greater than 10 mis during dialysis in each ease, due to the high 
viscosity ol* the Actiscp elution media. Aliquois of each sample were 20x concentrated using 
Ccntricon ^0 microconcentrators (Amicon) and stored at 4**C. The specificity of each region 
5 specific antibody pool was tested, relative to the dialyzed CTA IgY preparation, by Western 
blot analysis, exactly as described above, except that 4 ml samples of blocking buffer 
containing 100 |.il region specific antibody (unconcentrated) were used instead of the depleted 
CTA luY preparations. Each affinity purified antibody preparation was specific lo the defined 
interval, except that samples purified against intervals 1-5 als{^ reacted with interval 6. fhis 

10 may be due to non-specific binding to the interval 6 protein, since this protein contains the 
repetitive ligand binding domain which has been shown lo bind antibodies nonspecifically, 
ILycrh af at., Curr. Microbiol.. 19:303-306 ( I989).| 

The reactivity (^f each affinity purified antibody preparation to the corresponding 
proteins was approximately the same as the reactivity of the 1/500 diluted dialyzed CTA IgY 

15 preparation standard. Given that the specific antibody stocks were diluted 1/40. this would 
indicate that the unconceniraled al'finity purified antibody slocks contain l/l 0-1/20 the 
concentration of specific antibodies relative lo the starting CTA IgY preparati<m. 

c) Toxin A Neutralization As.say Using Antibodies Reactive 
-0 Toward Recombinant Toxin A Protein 

fhe Clio toxin neutralization assay [Example 8(d)l was used to assess the ability of 
the depleted or enriched samples generated in (b) above lo neuirali/e the cytoioxiciiy of toxin 
A. The general ability of affinity purified antibodies lo neutralize loxin A was assessed by 
mixing together aliquois of all 6 concentrated stocks of the (> affinity purified samples 

25 generated in (b) above, and testing the ability of this mixture to neutralize a loxin A 

concentration of 0.1 Mg/ml. The results, shown in Figure 1 1. demonstrate almost complete 
neutralization of toxin A using the affinity purified (AP) mix. Stime epitopes within the 
recombinant proteins utilized for affinity purification were probably lost when the proteins 
were denatured before affinity purification |by (iuanidine-IICI treatment in (b) above). Thus. 

.lO the neutralization ability of antibodies directed against recombinant protein is probably 

underestimated using these affinity purified antibody poiils. fhis experiment demonstrates 
that antibodies reactive to recombinant toxin A can neutralize cytotoxicity, suggesting that 
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neutralizing antibodies may be generated by using recombinant toxin A protein as 
immunogen. 

In view of the observation that the recombinant expression clones of the toxin A gene 
divide the protein into 6 subrcgions. the neutralizing ability of antiboilies directed against each 
individual region was assessed. The neutralizing ability of antibodies directed against the 
ligand binding domain of toxin A was determined first. 

In the toxin neutralization experiment shown in Figure 1 1, interval 6 specific 
antibodies (interval 6 contains the ligand binding domain) were depleted from the dinlyzcd 
IM-.G preparation, and the effect on toxin neutralization assayed. Interval 6 antibodies were 
depicted either by utilizing the interval 6 depleted CTA IgY preparation from (b) above ("-6 
air. depleted" in I'igure 11). or by addition of interval 6 protein to the CTA IgY preparation 
(estimated to be a 10 fold molar excess over anti-interval 6 immunoglobulin present in this 
preparaiioni to competitively compote for interval 6 protein ("-6 prot depleted" in Figure II). 
In both instances, removal of interval 6 specific antibodies reduces the neutralization 
efficiency relative to the starting CTA IgY preparation. I his demonstrates that antibodies 
directed against interval 6 contribute to toxin neutralization. Since inter%al 6 corresponds to 
the ligand binding domain of the protein, these results demonstrate that antibodies directed 
against this region in the ?V.G preparation contribute to the neutralization <i| toxin A in this 
a.ssa\ . I hnvevcr. it is significant thai after removal of these antibodies, the \'[IG preparation 
retains significant ability to neutralize toxin A (Figure I! ). Ibis neutralization is probably 
due to the action of antibodies specific to other regions of the toxin A protein, since at lea.st 
«><»% of the ligand binding region reacti\e antibodies were removed in the depleted .sample 
prepared in ( b) above. I his conclusion was supported by comparison of the toxin 
neutralization ot the affinity purified ( AP) mix compared to affinity purilled interval 6 
antibody alone. Although some neutralization ability was observed with AP interval 6 
antibodies alone, the neutralization was significantly less than that observed with the mixture 
of all () .Ar antibody stocks (not shown). 

(iiN'cn that the mix of all six affinity purified .samples almo.st completely neutralized 
the cytotoxicity of toxin A (Figure 11). the relative importance of amibodies directed against 
uixin A intervals 1-5 w ithin the mixture was determined. This was assessed in two ways. 
First, samples containing affinity purified antibodies representing 5 of the 6 inler\als were 
prepared, such, that each individual region was depleted from one sample. Figure 12 
demonstrates a sample neutralization curve, comparing the neutralization ability of airmilv 
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purified amibody mixes without interval 4 (-4) or 5 (-5) specific antibodies, relative to the 
mix of all 6 aniniiy purified antibody stocks (positive control). While the removal of interval 
5 specific antibodies had no etYect on toxin neutralization (or intervals 1-3. not shown), the 
loss of interval 4 specific antibodies significantly reduced toxin neutralization (l-igurc 12). 

Similar results were seen in a second experiment, in which afTmiiy purified antibodies, 
directed against a single region, were added to interval 6 specific antibodies, and the effects 
on toxin neutralization assessed. Only interval 4 specific antibodies significantly enhanced 
neutralization when added to interval 6 specific antibodies (l-igure 170. These results 
demonstrate that antibodies directed against interval 4 (corresponding lo clone pPAl 100-1450 
in I'igure 9) are important for neutralization of cyioloxicity in this assay, T.piiope mapping 
has shown that only low levels of antibodies reactive to this region are generated when native 
toxin A is u.sed as an immunogen (Example 12(a) |. ii is hypoihesi/ed that immunization with 
recombinant protein specific to this inierval will elicit higher liters ol* neutralizing antibodies. 
In summary, tins analysis has identified two critical regions of the toxin A protein against 
which neutralizing antibodies are produced, as assayed hy the CHO neutralization assav. 

EXAMPLi: 13 

Production And F.valuation ()!' Avian Antitoxin 
Against C. Jif/tcilc Recombinant Toxin A Polypeptide 

In lixample 12. we demonstrated neutralization ol' toxin A mediated cytotoxicity by 
alfniiiy purified antibodies reactive to recombinant toxin A protein. To determine whether 
antibodies raised against a recombinant polypeptide fragment oi' ( \ difficile toxin A may be 
elTeciive in treating clostridial diseases, antibodies to recombinant toxin A protein representing 
the binding domain were generated. Two toxin A binding domain recombinant polypeptides, 
expressing the binding domain in cither the pMALc (pMA 1 870-2680) or pET 23(pl»A1870. 
2680) vector, were used as immunogens. The pMAI. protein was affinity purified as a soluble 
product lli.xample 12(d)] and the pET protein was isolated as insoluble inclusion bodies 
(Example I2(d)l and solubilized to an immunologically active protein using a proprietary 
method described in a pending patent application (U.S. Patent Application ,Serial No. 
08/129.027). This Example involves (a) immunization, (b) antitoxin collection, (c) 
determination of antitoxin antibody tiler, (d) anti-recombinant toxin A neutralization of toxin 
A hemagglutination activity /;; vitro, and (e) assay of//? vitro toxin A neutralizing activity. 
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it) Immunization 

l lic soluble and the inclusion body preparations each were used separately to 
immunize hens. Both purified toxin A polypeptides were diluted in PBS and emulsilled with 
approximately equal volumes ofCFA for the initial immunization or IFA tor subsequent 
booster immunizations. On day zero, for each of the recombinant preparations, two egy 
laying white Leghorn hens (obtained from local breeder) were each injected at multiple sites 
(intramuscular and subcuianeou.s) with 1 ml of recombinant adjuvant mixture containing 
approximately 0.5 to 1.5 mgs of recombinant toxin A. Boo.ster immunizations of 1.0 mg 
were given on dayj? 14 and day 28. 

H) Antitoxin Collection 

Fotal yolk immune IgY was extracted as described in the standard PfXi protocol (as in 
i:xample I) and the final IgY pellet was dissolved in sterile PB.S ai ihc original yolk volume. 
This material is designated "immune recombinant IgY" or "immune IgY." 

c) Antitoxin Antibody Titer 

To determine if the recombinant toxin A protein was sufllcicntly immunogenic to raise 
antibodies in hens, the antibody titer of a recombinant toxin A polypeptide was determined by 
l-I.ISA. I'ggs iVom both hens were collected on day 32. the yolks pooled and the antibody 
\\as isolated using PR(J as described. The immune recombinant IgY antibody titer was 
determined lor the soluble reeombinani protein containing the malto.se binding protein fusion 
generated in p-Mal (pMA 1 870-2680). Ninety-six well Falcon Pro-bind plates vxere coated 
overnight at A^C with 100 nl /well of toxin A recombinant at 2.5 pg /^l in PBS containing 
0.05% ihinierosal. Another plate was also coated with maltose binding protein (MBP) at the 
same c()ncentratii)n. to permit comparison of antibody reactivity to the fusion partner. The 
next da>'. the wells were blocked with PB.S containing 1% bovine .serum albumin (BSA) for I 
hour at .37°C. IgY isolated from immune or preimmune eggs was diluted in antibody diluent 
(PB.S containing 1% BSA and 0.05% Twecn-20). and added to the blocked wells and 
incubated lor 1 hour at 37°C . The plates were washed three times with PBS with 0.05% 
7 ween-20. then three times with PBS. Alkaline phosphala.se conjugated rabbit ami-chicken 
IgG (Sigma) diluted 1:1000 in antibody diluent was added to the plate, and incubated for I 
liour at 37°C. Tlie plates were washed as before and substrate was added, fp-nitrophenyl 
phosphate (Sigma) I at I mg/ml in ().05M NaX'O,. pH 9.5 and 10 iVIgCI,. The plates 
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were evaluated quantitatively on a Dynatech MR 300 Micro EPA plate reader at 410 nm 
about 10 minutes after the addition of substrate. 

Based on these ELISA results, high antibody titers were raised in chickens immunized 
with the toxin A recombinant polypeptide. The recombinant appeared to be highly 
immunogenic, as it was able to generate high antibody titers relatively quickly with few 
immunizations. Immune IgY tiler directed specifically to the toxin A portion of the 
recombinant was higher than the immune IgY titer to its fusion partner, the maltose binding 
protein, and significantly higher than the prcimmune luY. HLISA liters (reciprocal of the 
highest dilution of IgY generating a signal) in the prcimmune IgY to the MBP or ihc 
recombinant was <l:30 while the immune IgY liters to MBP and the toxin A recombinant 
were 1:18750 and > 1:93750 respectively. Importantly, the anti-recombinani antibody titers 
generated in the hens against the recombinant polypeptide is much higher, compared to 
antibodies to that region raised using native toxin A. The recombinant antibody titer to 
region 1870-2680 in the CTA antibody preparation is at least live-tbUI lower compared to the 
recombinani generated antibodies ( 1:18750 versus -'1:93750). Thus, it appears a better 
immune response can be generated against a specific recombinant using that recombinani as 
the immunogen compared to the native toxin A. 

This observation is significant, as it shows that because recombinant portions stimulate 
the production of antibodies, it is not necessary to u.se native toxin molecules to produce 
antitoxin preparations. Thus, the problems associated with ihe toxicity of the native toxin are 
avoided and large-scale antitoxin production is facilitated. 

d) Anti-Rccombinant Toxin A Neutralization Of Toxin A 
Hemagglutination Activity In Viiro 

foxin A has hemagglutinating activity besides cytotoxic and enterotoxin properties. 
Specifically, toxin A agglutinates rabbit erythrocytes by binding to a trisaceharide (gal I-3B1- 
4GlcNAe) on the cell surface. fH. Krivan a ai. Infect. Immun., 53:573-581 (1986),] We 
examined whether the anti-recombinant toxin A (immune IgY. antibodies rai.sed against the 
insoluble product expressed in pET) can neutralize the hemagglutination activity of toxin A in 
vitro, fhe hemagglutination assay procedure u.sed was described by II.C. Krivan a ai 
Polyethylene glycol-fractionalcd immune or prcimmune IgY were prc-absorbed with ciiratcd 
rabbit erythrocytes prior to performing the hemagglutination assay hecau.se we have found that 
IgY alone can agglutinate red blood cells. Citrated rabbit red blood cells (RIlBC*s)(Cocalico) 
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were washed twice by centrifugation ai 450 x g with isotonic buffer (0.1 M Tris-HCI. 0.05 M 
NaCI. pH 7.2). RRBC-rcaciive antibodies in the !gY were removed by preparing a 10% 
RRBC suspension (made by adding packed cells to immune or preimmunc luY) and 
incubating the mixture for I hour at 37**C. The RRBCs were then removed by centrifugation. 
Neutralization of the hemagglutination activity of toxin A by antibody was tested in round- 
bottomed %-\vell microliter plates. Twcnty-nve ^1 of toxin A (36 ng /ml) (Tech Lab) in 
isotonic buffer was mixed with an equal volume of dilTcrent dilutions of immune or 
preimmunc IgY in isotonic buffer, and incubated for 15 minutes at room temperature. Then, 
50 Ml ^>f ii 1% RRBC suspension in isotonic bulTer was added and the mixture was incubated 
for } iiours at 4°(\ Positive control wells containing the final concentration of 9 ^g/ml of 
toxin .A after dilution without IgY were also included. Hemagglutination activity was 
a.s.scs.scd N isually. with a diffuse matrix of RRBCs coating the bottom of the well 
representing a posiiive hemagglutination reaction and a tighi button of RRBCs :u the bottom 
of the well representing a negative reaction. I he anii-recombinanl immune kY neutralized 
toxin .'\ hcnuiggkilinaiion activity, giving a neutralization titer of l:X. However, preimmunc 
IgY was unable to neutralize the hemagglutination ability of toxin A. 

c) Assay Of/// Vifro Toxin A Ncutnili£in^ Activity 

The ahilitN- of the anli-rccombinanl toxin A IgY (immune IgY antibodies raised against 
pVlAlX7(}-2C)S(). the soluble recombinant binding domain protein expressed in piVlAL. 
designated as Anli-tox. A-2 in figure 14 . and referred to as recombinant region <>) and pre- 
ininuine prepared as described in Hxample 8(c) above, to neutralize the cytotoxic activity 
of toxin A was assessed //; rifro using the CI 10 cell cytotoxicity assay, and toxin A (Tech 
Lab) at a concentration oro.lMg/ml. as described in Lxampic S(d) above. As additional 
controls, the anti-native toxin A luY (CTA) and pre-immune IgY preparations described in 
lixample 8(c) abo\e were also tested. The results are shown in Figure 14. 

The anii-recombinani toxin A IgY demonstrated only partial neutralization of the 
cytotoxic activity of toxin A. while the pre-inimune IgY did not demonstrate any significant 
neutralizing activity. 
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EXAMPLE 14 

In vivo Neutralization Q[ C. i/i/ficile Foxin A 



The ability of avian aniibodics (IgY) raised auainst rccombinani loxin A bindinu 
5 domain lo neutralize ihc enlerotoxin aciiviiy o\ C\ c/ifficilc loxin A was evaluated in vivn 
iLsinu Golden Syrian hamsters. The Example involved: (a) preparation of the avian anti- 
reeombinant loxin A IgY for oral administration: (b) in vivn protection of hamsters from C. 
ilijficilc loxin A enicroioxicily by treatment of toxin A ui(h avian anti-recombinant loxin A 
liiY: and (c) hisloloyie evaluation of hamster ceea, 

10 

a) Preparation Of The Avian Anti-Uceombinant Toxin A IrY 
For Oral Administration 

Lggs were collected from hens which had been imnuinizcd with the recombinant 
(Ii//ici/c toxin A fraument pMA 1 870-2680 (described in Example 13. above). A second grou 

15 ol'euus purchased at a local supermarket was used as a pre-immune (neijativeJ control. Lgg 
volk imnuinoylobulin (IgY) was extracted by PEG from the two groups of eg^s as described 
in L-xample 8(c). and the final IgY pellets were solubilizcd in one-fourth the oriiiinal yolk 
volume usinti O.IM carbonate buffer (mixture of Nal ICO: and Na.CXV). pH <).5. The basic 
carbonate bulTer was used in order to protect the toxin A from the acidic pi I of the stomach 

20 environment. 



b) //I I'm; I'rotcction Of Hamsters Against C difftcUe Toxin A 
Enterotoxicity By Treatment Of Toxin A With Avian Anti- 
rccombinant Toxin A IgY 

in order lo assess the ability of the avian anti-recombinani loxin A luY. prepared in 
section (a) above to neutralize the in vivo entcrotoxin activity of loxin A. an i7iv> toxin 
neutralization model was developed usiny Golden Syrian hamsters. This model was based on 
published values for the minimum amount of toxin A required to elicit diarrhea (0.08 mj! 
toxin A/K.U body wi.) and death (0.16 my toxin A/Ky body wi.) in hamsters when 
administered orally (Lycrly vi ul. Infect. Immun.. 47:349* ."52 (1985). 

I*or the study, four separate experimenial groups were used, with each group eonsisling 
of 7 female (iolden Syrian hamsiers (Charles River), approx. three and one-half weeks old. 
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vveighiiig approx. 50 gms each. The animals were housed as groups of 3 and 4. and were 
offered food and water uJ lihiium through the entire length of the study. 

Tor each animal, a mixture containing either lO^ig of toxin A (0.2 mg/Kg) or .lOMg of 
to.vin A (0.6 mg/Kg) (C. difficik' toxin A was obtained from l ech Lab and I ml of cither the 
anti-rccombinant toxin A IgY or pre-immune IgY (from section (a) abt>ve) was prepared. 
These mixtures were incubated at 37°C for 60 min. and were tiien administered to the animals 
by the oral route. The animals were then observed lor the onset of diarrhea and death for a 
period o|-24 hrs. following the administration of the toxin AHgY mixtures, at the end of 
which lime, the following results were tabulated and shown in Table 17: 

TABLE 17 



fvxporimenial Group 


Siudy Ouicome nt 24 Hours 


Hcniihy' 


Diarrhea- 


Dead 


10 uti iu.xin A - AnticoNin Auainst Interval 6 


7 


0 


0 


lO Mt; Toxin A - Aniiioxin Auainsi hucrval 6 




n 


0 


l(» lis: Toxin A • Pre-lmmunc Scrum 


0 






30 III: roxin A • Prc-lmmunc 




5 





Animals rcmnined hcaliliy throiiuh ihc cniire 24 hcnir siudv period. 
Animals dcvelopcil diarrhea, bui did noi die. 
Animals developed diarrhea, and subsequently died. 

Preircatnient ol' toxin A at both doses tested, using the anii-recombinant toxin A IgY. 
prevented all overt .symptoms of disea.se in hamsters. Therefore, pretreaiment ofC. Jifficik' 
toxin /\. using the anti-recombinani toxin A IgY. neutralized the in vivo enierotoxin activity 
ofilic toxin A. In contrast, all animals from the two groups which received toxin A which 
liad been pretreated using pre-immune IgY developed disea.se symptoms which ranged from 
diarrhea to death. The diarrhea which developed in the 5 animals which did not die in each 
of the tx\o pre-immune groups, spontaneously resolved by the end of the 24 hr. study period. 



c) Histologic Evaluation Of Hamster Ccca 

In order to further as.se.ss the ability of anti-recombinant toxin A IgY to protect 
hamsters fmm the enteroioxin activity of loxin A. histologic evaluations were performed on 
the ccca of hamsters from the .study described in section (b) above. 

Three groups of animals were saerinced in order to prepare histological specimens. 
The lirst group con.sisted of a single representative animal taken from each of the 4 groups of 
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surviving hamsters at ihe conclusion of the study described in scciion (b) above. These 
animals represented the 24 hr. timepoint of the study. 

The second group consisted of two animals wlilch were not part of the study described 
above, but were separately treated with the same toxin A * pre-immune IgY mixtures as 
5 described for the animals in scciion (b) above. Both of these hamsters developed diarrhea, 
and were sacrificed 8 hrs. alter the time of administration of the toxin A + pre-immune IgY 
mixtures. At the time of sacrifice, both animals were presenting sN mpioms of diarrhea. 
These animals represented the acute phase of the .study. 

The linal group consisted of a single untreated hamster fmm the same shipment ol* 

10 animals as those used for the two previous groups. This animal served as the normal control. 

Samples of cecal li.ssuc were removed from the 7 animals described abo\ c, and were 
fixed overnight at 4%' using 10% buftercd formalin. The fixed tissues were paraffin- 
embedded, sectioned, and mounted on glass microscope slides. The tissue sections were then 
stained using hematoxylin and eosin (II and E slain), and were examined by light microscopv. 

I he tissues obtained from the two 24 hr. animals which received mixtures containing 
either l()|.iu or 3U|.ig of toxin A and anti-rccombinani toxin A IgY were indistinguishable from 
the normal control, both in terms of gross pathology, as well as at the microscopic level. 
These observations provide further evidence for the ability of anti-rccombinani toxin A IgY to 
efl'ectivelv neiiirali/e the in vivo cnieroloxin activity of (' Ji/Jicilc toxin .A. and thus its ability 

20 to prevent acute or lasting toxin .'\-induced pathology. 

In contrast, the tissues from the two 24 hr. animals which received the toxin A + pre- 
immune IgY mixtures demonstrated significant pathology. In both of these groups, the 
nuico.sal layer was observed to be less organized than in the normal control tissue. I he 
cytoplasm of the epithelial cells hud a \acuolated appearance, and gaps were present between 

25 the epithelium and the underlying cell layers. The lamina propria was largel\ absent. 

Intestinal villi and crypts were significantly diminished, and appeared to ha\e been overgrown 
by a planar layer of epithelial cells and fibroblasts, fherefore. although these animals overtly 
appeared to recover from the acute symptoms of toxin A intoxication, lasting pathologic 
alterations to the cecal mucosa had occurred. 

30 The tissues obtained from the two acute animals which received mixtures of toxin A 

and pre-immune IgY demonstrated the must significant pathology. At the gross pathological 
level, both animals were observed to have severely distended ceca which were filled with 
watery, diarrhea-like material, /\t the microscopic level, the animal that was given the 
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mfxture containing lO^g of toxin A and pre-immune IgY was found to have a mucosal layer 
which had a ragged, damaged appearance, and a disorganized, compacted quality. The crypts 
vvcre largely absent, and numerous breaks in the epithelium had occurred. There was also an 
influx orerythr.,cyies into .spaces between the epithelial layer and the underlying tissue. The 
animal uhich had recC-ived the mixture containing 30pg of toxin A and pre-immune IgY 
demonstrated the most severe pathology. The cecal tis.sue of this animal had an appearance 
very similar to that okserNcd in animals which had died from C. c/////V,7c. disJa.s.. Widespread 
destruction of the mucosa was noted, and the epithelial layer had sloughed. Ilemorrhui-ic 
areas containing large numbers of erythrocytes were very prevalem. All semblance of normal 
tis.sue architecture was absent from this .specimen. In terms of the presentation of pathologic 
events, this vivo ham.ster model of toxin A-intoxieation correlates very closelv with the 
pathologic consequences of C. JiJfkUc disease in hamsters. The results presemcd in this 
lixample demon.strate that while anti-recombinani toxin A (interval 6, IgY is capable of onlv 
partially neutralizing the cytotoxic acliv ity of C. c/iffki/c toxin A. .he same antibodv 
effectively neutralizes 100% of the m vivo enierotoxin activity of the toxin. While it is not 
intended that this invention be limited to this mechanism, this may he due to the cviotoxicity 
and entcroti.xicily of C Ji/,idlc Toxin A as two .separate and distinct biological functions. 

EXAMPLE I.S 

In l ivo Neutralization Of C Dilfkih l o.xin A By 
Antibodies Against Recombinant Toxin A Polxpeptidcs 

riie abiliis ol a\ ian antibodies directed against the recombinant C. cliffialc toxin A 
Ihigmeni IS70-2680 (as expres.sed by pMA 1 870-2680: see Hxample 1.1) to neutralize the 
enienxoxic activity of toxin A was demonstrated in Example 14. The ability of avian 
antibodies (IgYs) directed against other recombinant toxin A epitopes to neutralize native 
toxin A viva was next evaluated. This example involved: (a) the preparation of IgYs 
auainsi ivc.imbinant toxin A pol> peptides: (b) in vivo protection of hamsters against toxin A 
b> treatment with anti-recombinant toxin A IgYs and (c) quamilieati^.n of specilic antibody 
concentration in CTA and Interval IgY PEG preparations. 

The nucleotide .sequence of the coding region of the entire toxin A protein is listed in 
SlIO II) N():5. The amino acid sequence of the entire toxin A protein is listed in Sl-Q ID 
N():fi. The amino acid sequence consisting of amino acid residues 1870 through 2680 of 
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loxiii A is listed in SKQ ID N0:7. The amino acid sequence consisling of amino acid 
residues 1870 through 1960 of loxin A is listed in SFQ ID N():8. 

21) Preparation Of IgY\s Against Recombinant l oxin A 
5 Polypeptides 

liggs were collected from Leghorn hens which have been immunized with recombinant 
( * clif/icUc loxin A polypeptide fragments encompassing the entire toxin A protein. The 
polypeptide fragments used as immunogens were: I ) pMA 1870-2680 (Interval 6). 2) pPA 
1 100-1450 (Interval 4). and 3) a mixture of fragments consisting of pMA 30-300 (Interval 1), 

10 pMA 300-660 (Interval 2), pMA 660-1 100 (Interval 3) and pMA 1450-1870 (Interval 5). 
This mixture of immunogens is referred to as Interval 1235. The location of each interval 
within the toxin A molecule is shown in Figure I5A. In Tigure I5A. the following 
abbrc\ iaiions arc used: pP refers to the pirr23 vector (New lingland BioLabs): pM refers u> 
the pMAI.^^'-c vector (New lingland BioLabs): A refers to toxin A: the numbers refer to the 

15 amino acid interval expressed in the clone. (For example, the designation pMAjO-3()0 

indicates iliat the recombinant clone encodes amino acids 30-300 of loxin A and the vector 
used was pMAi.'^'-c). 

I'he recombinant proteins were generated as described in l-xamplc I I. The IgYs were 
extracted and solubili/ed in 0. IM carbonate buffer pM 9.5 for oral administration as described 

20 in l-xample 14(a). I he IgY reactivities against each individual recombinant interval was 
evaluated by liLISA as described in Fixample 13(c). 

h) In yivo Protection Of Hamsters Against Toxin 

A By Treatment With Anti-Recombinant Toxin A Antibodies 

25 1 he ability of antibodies raised against recombinant loxin A polypeptides to provide in 

vivo protection against the enterotoxic activity of toxin A was examined in the hamster model 
system. This assay was performed as described in Example 14(b). Brielly, for each 40-50 
gram female (iolden Syrian hamster (Charles River), 1 ml of IgY 4X (i.e.. resuspended in 1/4 
of the original yolk volume) PI:(i prep against Interval 6, Interval 4 or Interval 1235 was 

30 mixed with 30 |.ig (l.D,.„, oral do.se) of difficile toxin A (Tech l.ab). Preimmune IgY 

mixed with toxin A served as a negative control. Antibodies raised against ( \ difficile toxoid 
A (F'xample 8) mixed with toxin A (CTA) served as a positive control. The mixture was 
incubated for I hour at 37^C then orally administered to lightly etherized hamsters using an 
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I8G fecdiny needle. The animals were then observed for llie onset of diarrhea and deaih for 
a period of approximately 24 hours. The results are shown in Tabic 1 8. 



TABLF. 18 

Siii(l\ Oiiicomc A Tier 24 Honrs 



Trcatmcni ijroiip 


Heallhy' 


Diarrhea" 


Dead' 


Prcinimtmc 


0 


i) 


7 


C'TA 


5 


0 


0 


liiteivnl 6 


(^ 


1 


0 


Inierval \ 


{) 


1 


6 


Imerviil \2j5 


0 


(1 


7 



Animal slums no siun of illness. 

Animal developed diarrhea, but did noi die. 

Animal developed dijirrhca and died. 



l*re-ircLiimcni of loxin A with IgYs against lnicr\'al 6 prevailed diarrhea in 6 ol* 7 
luniisiLTs and complcicly prcvcnlcd death in all 7. in contrast, as with prcininninc ItiY. IliYs 
auainst Interval 4 and Interval 1235 had no effect on ihc onset of diarrhea and death in the 
hiimsters. 



c) Quantification Of Specific Antibody Concentration In CI A 
And interval 6 IrY PK(; Preparations 

To determine the purity of luY PEG preparations, an aliquot of a pMAl 87()-26«() 
{lnicr\iil (>) \iiY PIX] preparation was chroinatographed usinu UPl.C and a K\V-8():> si/Jny 
column (.Shodex). Ihe resulting profile of absorbance at 280 nm is shown in figure 16. The 
single large peak corresponds lo the predicted MW of IgY. Integration of the area under the 
single large peak showed that greater than 95% of the protein eluled from the column was 
present in this single peak. This result demonstrated that the majority (>95%) ol* the material 
absorbing at 280 nm in the PEG preparation corresponds lo IgY. fherefore. absorbance at 
280 nm can be used lo determine ihe loial antibody concentration in PIZG preparations. 

I'o determine the concentration of Inierval 6-speeillc antibodies (expressed as percent 
of loial aniibody) within the CTA and pMA1870.2680 (Interval 6) PFG preparations, defmcd 
quantities of these aniibody preparations were afllnily purilled on a pPA 1 870-2680(1 1) (shown 
schematically in higure I5B) afllnily column and the .specific antibodies were quantified. In 
I-igure I5B the following abbreviations are used: pP refers to ihe pCT2.1 vecior (New England 
liioLabs): pM refers lo the pMAI."^-c vector (New England BioLabs): pG refers to the pGEX 
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vector (Pharmacia); pB refers lo ihe PinPoini'^' Xa vector (Promega): A refers lo loxin A: ihe 
numbers refer lo the amino acid inicrval expressed in the clone. The solid black ovals 
rcprescni the MBP: the hatched ovals represent glulaihione S-iransferase: the hatched circles 
represent the biotin tau: and HUM represents the poly-hisiidine tag. 

An affinity column containing recombinant loxin A repeat protein was made as 
follows. I-our ml of PBS-washcd Aciigel resin (Sterogcne) was coupled with 5-10 mg of 
pPAI«7().2680 inclusion body protein (prepared as described in Example ( 17) and dialyzed 
into PBS| in a 15 ml lube (Falcon) containing l/IO final volume Ald-coupliny solution (I M 
sodium cyanoborohydride). Aliquois of the supernatant from the coupling reactions, before 
and after coupling, were assessed by Coomassie staining of 7,5% SDS-PAGI: gels. Based 
upon protein band intensities, greater than 6 mg ol* recombinant protein was coupled lo the 
resin. The resin was poured into a 10 ml column (BioRad). washed extensively with PBS. 
pre-eluted with 4 M guanidine-fiCI (in M) mM Tris-HCI. pli X.O: ().()()S% ihimerosal) and rc- 
equilibrated with PBS. I he column was stored at 4W 

Aliquois of a pMAI 870.2680 (Interval 6) or a CTA IgY polyclonal antibody 
preparation (PlZCi prep) were aftlnity purified on ihe above alllniiy column as Jollows. fhe 
column was attached lo an liV monitor (ISCO) and washed wiih PBS. I or pMA 1 870-2680 
IgY purilieaiion. a 2X Pl.:(i prep (Jllier sterilized using a 0.4.S p liUer: approximately mg 
total IgY) was applied. The column was washed with PBS until the baseline was re- 
established (the column llow-ihrough was saved), washed with BBSTween to ekue 
nonspeeillcallN- binding antibodies and rc-equilibraied with PBS. IJound antibody was eluled 
from the column in 4 M guanidine-IICI (in 10 mM Tri.s-liCl. pH 8.0: 0.005% thimero.sal). 
fhe entire elulion peak was collected in a 15 ml lube (falcon). I he column uas re- 
equihbrLitcd and the column eluaie was re-ehromalographed as described above, fhe antibody 
preparation was quantilled by I 'V absorbance (the elulion buffer was used lo zero the 
spectrophoiomeier). Fotal puritied antibody was approximately 0 mg and I mg from the tlrst 
and second chromatography passes, respectively. The low yield from the second pass 
indicated that mo.si specific antibodies were removed by the first round of chromatography. 
The estimated percentage of Interval 6 specific antibodies in the pMA 1 870-26X0 Pf.G prep is 
approximaiel\" 2%. 

The percentage of lnter\al 6 specific anlibodics in the CTA PF.G prep was determined 
(utilizing the same column and methodology described above) to be approximately 0.5% of 
lotal IgY. 
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A 4X PEG prep comains approximately 20 mg/ml IgY. Thus in b) above, 
approximately 400 specific antibody in the Interval 6 PEG prep neutralized 30 ^ig toxin A 
in vivo. 



EXAMPLE 16 

h I 'ivo Treaiment Of C*. cli/j/icilc Disease In 
Hamsters By Recombinant Interval 6 Antibodies 

The ability of antibodies directed against recombinant Interval 6 of toxin A lo protect 
hamsters //; viva from C. c/i/fici/c disease was examined. This example involved: (a) 
prophylactic treatment olT. ^/z/Z/d/c disease and (b) therapeutic trcatniLMit olT. Jif/id/c 
disease. 

:i) Prophylactic Treatment Of C. (lif/ici/e Disea.sc 

i his experiment was pcrlbrmcd as described in txample ^(b). Tiiree groups each 
consisting of 7 lemale 100 gram .Syrian hamsters (Charles Rixcr) were prophyiactically treated 
with either prcimmunc luYs. I^Ys against native toxin A and B (CI AB: see lixample X (a) 
and (b)| or ig\s against interval 6. IgYs were prepared as 4X PlXi preparations as described 
in Hxample 

i iic animals were orally do.sed .1 times daily, roughl.v at 4 hour intervals. Ibr 12 days 
with I ml antibody preparations diluted in tnsure-;i<;. Using estimates of specific antibody 
conccniraiion iVom Example 15(c). each dose of the Interval 6 antibody prep contained 
approximaiclx 400 ng of spcciiic antibody. On day 2 each hamster was predisposed to C. 
diflkilc inlcction hy the oral administration of .1.0 mg of Clindamycin-IICI (Sigma) in I ml 
ol water. On day .1 the ham.sters were orally challenged with I ml of C. t/////t7/t' inoculum 
strain ATCC 4.1596 in sterile saline containing approximately 100 organisms. The animals 
were then observed for the onset of diarrhea and subsequent death during the treatment 
period, fhe results arc shown in Table 19. 
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TABLE 19 
Lethality After 12 Days Or Trcnimeni 



Treatment Group 


Number Animals Alive 


Number Animals Dead 


Prcimniunc 


0 


7 


CTAB 


6 


1 


Interval 6 


7 


0 



Ticalinenl of hamsters with orally-administered luYs ayainst Interval 6 siieccssf'ully 
protected 7 out of 7 (100%) ol" the animals from ( ' ili/ftcilv disease. One of the hamsiers in 

10 this group presented with diarrhea which subsequently resolved during the course of 

ireaimenL As shown previously in lixamplc 9. antibodies to native toxin A and toxin B were 
hiyhlN- protective. In this l-.xample. b out of 7 animals survived in the C'I'AB treatment group. 
All of the hamsiers treated with preimmune sera came down with diarrhea and died. The 
surN ivurs in both the C'TAIi and Interval 6 groups remained heahhy throughout a 12 du\ post- 

15 ircaimcnl period. In particular. 6 out of 7 Interval 6-treated hamsters survived at least 2 
weeks after termination of treatment which suggests that these antibodies prov ide a long- 
lasting cure. These results represent the first demonstration that antibodies generated against a 
recombinant region of loxin A can prevent CDAI) when adminislered passivelv to anitnals. 
riiese results also indicate that antibodies raised against Interval alone nia\ be sufficient to 

20 protect animals Ironi C*. cliff/cilc disease when administered prophyluciically. 

Prc\ iously others had raised antibodies against toxin A by acii\el> ininnini/.ing 
hamsters iijiainst a recombinant polypeptide located within the Inierval 6 region jl.yerU. 
D.M.. a ill. ( 19^)0) Curr. Microbiol, 2I:29|. Figure 17 shows sciiemaiically the location of 
the Lyerly. cv al. inira-lnlerval (i recombinant protein (cloned into the pUC vector) in 

25 comparison with the complete Inierval 6 construct (pMAlS70-2680) used herein to generate 
neutralizing antibodies directed against loxin A. In I-igure 17. the solid black oval represents 
the MBP which is fused to the toxin A Interval 6 in pMAl 870-2680. 

The Lyerly. ct al. antibodies (inlra-lnlerval 6) were only able to partially protect 
hamsters against difficile infection in terms of survival (4 out of 8 animals survived) and 

30 furthermore, these antibodies did not prevent diarrhea in any of the animals. Additionally, 
animals treated with the inlra-lntcrval 6 antibodies [I.ycrly. ei ai (I WO), supra] died when 
treatment was removed. 

In contrast, the experiment shown above demonstrates that passive administration of 
anti-Interval ft antibodies prevented diarrhea in 6 out of 7 animals and completely prevented 
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death due to CDAD. Furthermore, as discussed above, passive administration of the anti- 
Interval 6 antibodies provides a ioni: la.sting cure {i.e.. treatment could be withdrawn without 
incident). 

b) I'herapcutic Treatment Of C difficUe Disca.sc: In Vivo 
Treatment Of An Established C. tlifficUe Infection In 
Hamsters With Recombinant Interval 6 Antibodies 

The ability of antibodies against recombinant interval 6 of toxin A to therapeutically 
treat r. ilitlkik- disease was examined. The experiment was performed essentially a.s 
described in t.xample lOfb). Three groups, each containing seven to eight female Oolden 
Syrian hamsters (100 g each: Charles River) were treated with either prcimmunc IgY. IgYs 
against native lo.xin A and toxin B (CTAB) and IgYs against Interval 6. I he antibodies were 
prepared as described above a.s 4X PF.Ci preparations. 

The hamsters were first predisposed to C. infection with a .1 mg di)se of 

C lindam>cin-HC'l (.Sigma) admini.siered orally in I ml of water. Approximate!) 24 hrs later, 
the animals were orally challenged with I ml of C. ilifricik- strain ATCC -4.1596 in sierilc 
saline ciinuiininu approximaich 200 organisms. One day after infection, the presence ol' loxin 
A and M uas determined in tiic Icccs of the hamsters using a commercial immunoassay kit 
(Cytoclone A-B liPA. Cambridge Biotech) to verify establishment of infection, l our 
members of each group were randomly selected and tested. Feces from an uninfected hamster 
was tested as a negative ct)nirol. All infected animals tested po.sitive for the presence of toxin 
accoriling ti) the numufaciurer s procedure. The initiation of treamient then started 
appro.ximalcly 24 hr post-infection. 

I he animals were do.scd dail>> at roughly 4 hr intervals with I ml antibody preparation 
diluted in linsurcK: (Ross Labs). The amount of specilic antibodies given per dose 
(determined b\ aflinity purification) was estimated to be about 400 ^ig of anti-Interval 6 IgY 
(for animals in the Interval 6 group) and 100 ^ig and 70 ng of anti-toxin A (Interval 6- 
specille) and anti-toxin B (Interval .VspcciHc: see Example 19). rcsptxnively. lor the CTAB 
preparation. I lic animals Nvere treated lor 9 days and then observed for an additicmal 4 days 
for the presence of diarrhea and death. The results indicating the number of survivors and the 
number of dead 4 days post-infection are shown in Table 20. 
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TABLE 20 

In vivo Therapeuiic Treatment With Inlerval 6 Antibodies 



Trcainieni Group 


Number Animals Alive 


Number Animals Dead 


Prcimmiine 


4 


3 


CTAB 


8 


0 


Iniervat 6 


8 


0 



Amibodics directed against both Interval 6 and CTAIi succcsstully prevented death 
iVom ( ilif/icilc when ihcrapcuiically administered 24 hr after infection. This result is 

10 significant since many investigators begin therapeutic treaiinent of hamsters with existing 

drugs {c.ii.. vancomycin, phenelfamycins. liacumieins, etc.) 8 hr post-infection [Swanson. c/ 
al. (IWI) Antimicrobial Agents and Chemotherapy 35:1 108 and (1989) J. Anlibiolics 42:94|. 

l orty-iwo percent of hamsters ti*eated with preimnuinc IgY died from C.'DAI). While 
the anli-lnterval 6 antibodies prexcnted death in the treated hamsters, they ciid not eliminate 

I? all symptoms of CDAI) as .1 animals presented with slight diarrhea. In addition, one ( TAB- 
trealcd and one preimmune-trcatcd animal also had diarrhea 14 days posi-inlection. These 
results indicate that anti-Interval 6 antibodies provide an elTeciive means ol' therapy for 
CIMI). 

20 EXAMPLE 17 

Induction Of 'foxin A Neutralizing Antibodies Requires Soluble Interval 6 Protein 

As shown in [Examples 1 1(d) and 15. expression of recombinant proteins In /:. coli 
may result in the production of either soluble or insoluble protein. If insoluble protein is 

25 produced, the recombinant protein is solubilizcd prior to immunization ot animals. To 

determine whether, one or both of the soluble or insoluble recombinant proteins could be used 
to generate neutralizing antibodies to toxin A, the following experiment was performed. This 
example involved a) expression of the toxin A repeats and subfragmcnis of these repeals in 
afli using a variety of expression vectors: b) identification of recombinant toxin A repeats and 

30 sub-regions to vVhich neutralizing antibodies bind: and c) determination of the neutralization 
ability of antibodies raised against soluble and insoluble toxin A repeat immunogen. 
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a) Expression Of The Toxin A Repeats And Subfragments Of 
These Repeats In £1 coli Using A Variety Of Expression 
Vectors 

The Interval 6 immunogcn utilized in Examples 15 ami 16 was the pMAI 870-2680 
protein, in which the toxin A repeats arc expressed as a soluble fusion protein with the MBP 
(clcscrilx-d in Example II). Interestingly, expression of this region (from the .S/,.I site to the 
end of the repeat.s. sec Figure I5B) in three other expression constructs, as cither native 
(PPAI870-2680). poly-llis tagged fpPAI870-2680 (11)1 or biotin-tagged (pBAl 870-2680) 
proteins resulted in eomplctely insoluble protein upon induction of the bacterial host (see 
Figure I5B). The host .strain BI.2I (Novagen) was used for expression of pDA 1 870-2680 and 
host strain BI.2l(Dr:.-?) (Novagen) was used for expression of pPA1870-2680 and pPAI870- 
2680(11). These insoluble proteins accuniulated to hiuli levels in inclusion bodies. Expression 
of recombinant pla.smids in E. cofi host cells grown in 2X YT medium was performed as 
described |William.s. c/ «/. (|995>. siipni]. 

As summarized in Figure 1 5B. expression of fragments of the toxin A repeats (as 
either N-terminal SpcUEam fragments, or C-terminai £a,RI-cnd fragments) also vicldcd 
high levels of insoluble protein using pCiEX (p(iA 1 870-2190). PinPoint'--.Xa (pBAI870-2190 
iind pBA225()-2680) and pliT expression systems (pPA 1870-2 190). The pCiliX and pET 
expression systems are described in Example 1 1. The PinI>oint--Xa expression svstem drives 
ihe expiession of fusion proteins in £. coii. f usion proteins from I>inP„int'^'-Xa v-cctors 
eoniain a bioiin tag at the amino-ierminal end and can be affinity purilled Sofil.ink'^' Soft 
Release av idin resm (Promega) under mild denaturing conditions ^> mM biotin). 

ihe solubility of expressed proteins from the pPG 1 870-2 1 90 and pPA 1 870-2 1 90 
expression eon.struets was determined after induction of recombinant protein expression under 
eonditi(,ns reported to enhance protein solubility [These conditions compri.se growih of the 
host at reduced temperature (.lO^C) and the utilization of high (I mM IPT(]) or low (0.1 mM 
IPTCi) concentrations of inducer [Williams ./ «/. (1995). .upra]. All expres.sed recombinant 
toxin A protein uas in.soluble under these conditions. Thus, expression of these fraumcms of 
the toxin A repeats in pET and pGEX expression vectors results in the production of iasoluble 
recombinant protein even when the host cells are grown at reduced temperature and using 
lower concentrations of the inducer. Although expression of these fragments in pMal vectors 
y.elded alfmity purillable .soluble fusion protein, the protein was either predominantiv 
.".soJuble (PMAI870-2190) or unstable (pMA2250-2650). Attempts to solubili^c expressed 
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protein from ihe pMA 1870-2 190 expression consirucl using reduced temperature or lower 
inducer concentration las described above) did not improve fusion protein solubility. 

Collectively, these results demonstrate that expression of the toxin A repeal region in 
E, L'oli results in the production of insoluble recombinant protein, when expressed as cither 
.5 large (aa 1870-2680) or .small (aa 1870-2190 or aa 2250-2680) fragments, in a variety of 

expression vectors (native or poly-his tagged pET. pGLX or HinPoint^''^- Xa vectors), utilizing 
gnnvih conditions shown to enhance protein solubility. Tlie exception to this rule were 
fusions with the MBI\ which enhanced protein solubility, either partially (pMA 1870-2 190) or 
fully (pMA 1X70.2680). 

10 

h) Idcntirication Of Recombinant Toxin A Repeats And Sub- 
Regions To Which Neutralizing Antibodies Bind 

loxin A repeal regions to which neutralizing cintibodics hind were identified hy 
utilizing recombinant toxin A repeal region proteins expressed as soluble or insoluble proteins 

15 to deplete pnnecti\c antibodies from a polyclonal pool of antibodies against native ( dijficilc 
toxin A. An in viva assay was developed to evaluate proteins for the ability to bind 
neutralizing antibodies. 

f he rational for this assay is as follows. Recombinant proteins were first pre-mixed 
with antibodies against native toxin A (CTA antibody: generated in l:\ample X) and allowed 

20 to react. Subsequently. (* difficile toxin A was added at a concentration lethal to hamsters 
and the mixture was administered to hamsters via IP injection. If the recombinant protein 
contains neutralizing epitopes, the C TA antibodies would lose their ability U) bind toxin A 
resulting in diarrhea and/or death of the hamsters. 

fhe assay was performed as follows, 'f he lethal dose of loxin A when delivered orally 

25 to nine 40 to 50 g (lolden Syrian hamsters (Sasco) was determined to be 10 to 30 jjg. The 

PnCj-purificd CTA antibody preparation was diluted to 0.5X concentration (/.c. the antibodies 
were diluted at twice the original yolk volume) in 0.1 M carbonate buffer. pH 9.5. fhe 
antibodies were diluted in carbonate buffer to protect them from acid degradation in the 
stomach, fhe concentration of 0.5X was used becaase it was found to be the lowest effective 

."lO concentration against toxin A. The concentration of Interval 6-specinc antibodies in the 0.5X 
CI A prep was estimated to be 10-15 (estimated using the method described in 

lixample 15). 
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Hie inclusion body preparation (insoluble Interval 6 protein: pPAI870-2680(H)) and 
the soluble Interval 6 protein (pMA 1 870-2680: see Figure 15] were both compared for their 
ability to bind to neutralizing antibodies against C. dijfiak toxin A (CTA). Specifically. I to 
2 mg of recombinant protein was mixed with 5 ml of a 0.5X CTA antibody prep (estimated 
10 contain 60-70 ^g ol' Interval 6-specillc amibody). After incubation for I hr at 37»C. CTA 
(Tech l.ab) at a llnal concentration of 30 Mg/ml was added and incubated for another I hr at 
37T. One ml of this mixture containing 30 Mg of toxin A (and 10-15 ng of interval 6- 
specilic antib,)dy) was administered orally to 40-50 g Golden Syrian hamsters (Sasco). 
Recombinant proteins that result in the loss of neutralizing capacity of the CTA antibody 
would indicate that those proteins contain neutralizing epitopes. Preimmune and CTA 
antibodies (both at 0.5X) without the addition of any recombinant protein served as negative 
and positive controls, respectively. 

Two other inclusion btuly preparations, both expressed as insoluble products in the 
pl-T vector, were tested: one comaining a different insert (toxin B fragment) other than 
Interval (, called pI>B 1 8.S0-2070 (see Figure 18) which serves as a control for insoluble 
Interval (>. the other was a truncated version of the Interval 6 region called pPA 1 870-2 1 90 
(.sec l- igurc I5B). The results of this experiment arc shown in Table 21. 

TABLE 21 



Trcninieni Cimup' 


Healthy- 


^. ...... ..1I,IUT \^tll(.\MII^- 

Diiirrhca' 


/\in.T nours 
Dead' 


IVcimiiuine Ah 


0 






C TA Ah 


4 


1 


0 


C*TA Ah - \\^\ o (.suluhlc) 


I 


2 




CTA Ah ^ hil 6 (iMsoluhlc) 


5 


0 


0 


CTA Ah pPB 1850-2070 


5 


0 


0 


CTA Ah ^ pPA 1870-2 190 


5 


0 


0 



\ C. Jilliak- loxin A (CTA) wn.s added lo each group. 

Animals showed no signs of illness. 
■ Animals developed diarrhea but did noi die. 

Animals developed diarrhea and died. 

Prcimmiinc antibodx- was ineffective against toxin A. while anti-C TA antibodies at a 
dilute 0.5X concentration almost completely protected the hamsters against the enterotoxic 
cllects of CTA. The addition of recombinam proteins pPB 1850-2070 or pPA 1 870-2 190 lo 
the anti-(TA antibody had no effect upon its protective ability, indicating that these 
recombinam proteins do not bind to neutralizing antibodies. On the other hand, recombinant 
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Interval 6 protein was able to bind to neutralizing anti-CTA antibodies and neutralized the in 
v/vn protective effect of the anti-CTA antibodies. Tour out of five animals in the group 
treated with anti-CTA antibodies mixed with soluble Interval 6 protein exhibited toxin 
associated toxicity (diarrhea and death). Moreover, the results showed that Interval 6 protein 
miisi be expressed as a soluble product in order for it to bind to neutralizing anii-C I A 
antibodies since the addition of insoluble Interval 6 protein had no effect on the neutralizing 
capaciiN ol' the CTA antibody prep. 

c) Determination Of Neutralization Ability Of Antibodies 
Kiiiscd Against Soluble And Insoluble l oxin A Uepeat 
Immunogcn 

To determine il^ieuiralizing antibodies are induced against solubilizcd inclusion 
bt)dics. insoluble toxin A repeal proiein was .solubilizcd and specific antibodies were rai.sed in 
chickens. Insoluble pPA 1 870-2680 protein was solubili/cd using the method described in 
Williams cf ul. (1995), supra, Briclly, induced cultures (500 mh were pelleted by 
ccntriruuaiion at 3.000 X g for 10 min at 4"(\ Ihc cell pellets were resuspendcd thoroughly 
in 10 ml of inclusion body sonication buffer (25 mM HI:PES pM 7.7. 100 mM KCI. 12.5 mM 
MgC'l. 20% glycerol. 0.1% (v/v) Nonidct P-4(). 1 mM DTT). The suspension was transferred 
to a 30 ml non-glass centrifuge tube. I'ivc hundred [{\ of 10 mg/ml lysozyme was added and 
the tubes were incubated on ice for 30 min. fhe suspension was then iVozen at -70''C for at 
least 1 hr. I he suspension was thawed rapidly in a water bath at room temperature and then 
placed on ice. The suspension was then sonicated using at Iciisi eight 15 .sec bursts of the 
microprobe (liran.son Sonicator Model No. 450) with intermittent cooling on ice. 

riic sonicated suspension was transferred to a 35 ml Oakridge lube and centrifuged at 
6.000 X g for 10 min at 4*^ to pellet the inclusion bodies. The pellet was washed 2 limes by 
pipeuini: or voriexing in fresh, ice-cold RlPA buffer |0.l% SDS. 1% Triton X-IOO. 1% 
sodium deoxycholate in TBS (25 mM Tris-Cl pH 7.5. 150 mM Na(:i)|. The inclusion bodies 
were recentrifuged after each wash. The inclusion bodies were dried and transferred using a 
.small metal spatula to a 15 ml tube (l-aleon). One ml of 10% SDS was added and the pellet 
was .solubilizcd by gently pipetting the solutit>n up and down using a 1 ml niicropipettor. The 
solubilization was facilitated by heating the sample to 95**C' when necessary. 

Once the inclusion bodies were in solution, the samples were diluted with 0 volumes 
of PBS. rhe protein solutions were dialyzed overnight against a lOO-fold volume of PBS 
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containing 0.05% SDS at room temperature. The dialysis buffer was then ciianged to PBS 
containing 0.01% SDS and the samples were dialyzed for several hours to overnight at room 
temperature. The samples were stored at A'C until used. Prior to further use. the samples 
were warmed to room temperature to allow any precipitated SOS to go back into solution. 

The inclusion body preparation was used to immunize hens. The protein was dialyzed 
into PBS and emulsified with approximately equal volumes of CFA for the initial 
immunization or IKA for subscquem booster immunizations. On day zero, for each of the 
recombinant recombinant preparations, two egg laying white Leghorn hens were each injected 
at multiple sites (IM and SO with I ml of recombinant protein-adjuvant mi.xture containing 
appro.ximaiely 0.5-1.5 mg of recombinant protein. Booster immunizations of 1.0 mg were 
given of da>s 14 and day 28. l-ggs were collected on day 32 and the amibody isolated using 
PliCi as described in L-xampIe 14(a). High titers of toxin A spceillc antibodies were present^ 
las as.sa>ed h> I-:LISA. using the method described in F.xample l.l). Titers were determined 
for both antibodies again.st recombinant polypeptides pPA 1 870-2680 and pMA 1870-2680 and 
were f«)und to be comparable at > 1 :62.500. 

Antibodies again.st soluble Inierval 6 (pMA 1 870-2680) and insoluble Interval 6 
KiiKlusion body). pPAI870.2680| were tested for neutralizing ability against toxm A using 
the /// 1-/.V, tissa\ described in lixample 15(h). I'reininiunc antibodies and antibodies again.st 
toxin A (CTA) served as negative and positive controls. respeciivel>-. The results are shown 
in Table 22. 

TABLE 22 



Aiuibod) 1 reaijiieni Group 


Heaitlu' 


Diarriica' 


xntiM /Xlltl 1 lOlirS 

Dead' 


I'reinimunc 


1 


0 




CTA 


5 


(} 


(1 


Interval 6 (Suliibje)' 


5 


0 


0 


Inierval 6 (Insoluble) 


(J 


•» 


.» 



Animals .showed no .sign of illnes.s. 
Animal developed diarrhea Ixii did nni die. 
Anininl developed dinrrhca and died. 
•JOO lie ml. 

Amibodies raised against native toxin A were protective while preimmune antibodies 
had little effect. As found using the in vitro CHO a.ssay [described in Example X(d)| where 
amib(,dies rai.sed against the soluble Interval 6 could partially neutralize the effects of toxin A. 
here they uere able to c(»mpleicly neutralize toxin A //; vim. In contra.st. the antibodies 
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raised againsl the insoluble Interval 6 was unable to neutralize the elTccls of loxin A in vivo 
as shown above (Table 22) and in vitro as shown in the CHO assay [described in Example 

mi 

These results demonstrate that soluble toxin A repeat immunogen is necessary to 
5 induce the production of neutralizing antibodies in chickens, and that ihc generation ol'such 
soluble immunogen is obtained only with a specific expression vector (pMal) containing the 
loxin /\ region spanning aa 1870-2680. That is to say, insoluble protein that is subsequently 
solubilizcd does not result in a toxin A antigen that will elicit a neutralizing antibody. 

10 EXAMPLE 18 

Cloning And l:\pression Of The ( ' Ji/ficilc Town B ( icne 

1 he (oxin B uciic has been cloned and sequenced: the amino acid sequence deduced 
tVom the cloned niiclcolidc sequence predicts a MW of 269.7 kO lor loxin B IBarroso cf a/.. 

15 NlicI. Acids Res. 18:4004 (IWO)j. fhe nucleotide sequence of the coding reyion ol' ihc entire 
loxin B j^cnc is listed in SEO ID N():9. The amino acid sequence «)f ihe entire loxin B 
protein is listed in S[{Q ID NO: 10. The amino acid sequence consisting of amino acid 
residues 1850 ihrousih 2360 of loxin B is listed in Sl-Q li) NO: I 1. I he amino acid sequence 
consisting of amino acid residues 1750 through 2360 of toxin B is listed in Sl-Q ID NO:12. 

20 Ciiven ihe expense and difficulty of isolating native loxin B protein, ii would be 

advaniageous to use simple and inexpensive procaryotic expression systems lo produce and 
purify high le\els of recombinant toxin B protein for immunization purposes. Ideally, the 
isolated recombinant protein would be soluble in order to preserve native antigenicity, since 
solubili/ed inclusion body proteins often do not fold into native conformations. Indeed as 

25 shown in Example 17. neutralizing antibodies against recombinant toxin A were only obtained 
when soluble recombinant toxin A polypeptides were used as the immunogen. To allow ease 
of purification, the recombinant protein .should be expressed to levels greater than 1 mg/liter 
of coli culture. 

fo determine whether high levels ol' recombinant toxin tt protein could be produced in 
30 /:. ioli. fragments of the toxin Ii gene were cloned into various prokaryotic expression 

Ncctors. and assessed for the ability to express recombinani toxin B protein in li. coli. This 
Hxample in\'olved (a) cloning of the toxin B gene and (b) expression of the toxin B gene in 
/:. coli. 
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a) Cloning Of The Toxin B Gene 

The loNin B gene was cloned using PCR amplification from C difficile genomic DNA. 
Initially, ilie gene was cloned in two overlapping fragments, using primer pairs P5/P6 and 
P7/P8. The location of these primers along the toxin B gene is shown schematically in Figure 
18. The sequence of each of these primers is: P5: 5* TAGAAAAAATCjGCAAATGT 3' (SBQ 
II) NO: 1 1 ): P6: 5' TTTCATCTTGTA GAGTCAAAG X (SEQ ID N0:I2): 
P7: y CiAKJCX-^ACAAGATGATTTAGTG .V (SEQ ID NO: 13): and 
P8: 5- CTAATTGAUCTGTATCAGGATC V (SEQ ID N0:14). 

l- igure 18 also shows the location of the following primers along the toxin B gene: P9 
which consists of the sequence 5" CGGAATTCCTAGAAAAAA TGGCAA ATG 3" (SEQ ID 
NO: 1 5): PIO which consists of the sequence 5' GCTCTAGAATGA CCATAACiCTAGCCA 
3' (SEQ ID N():16); PI I which consists of the sequence 

5- C (i(iAATT(;GAGTT(jGTAGAAAGGTGGA 3' (SEQ ID NO:l7): P13 which consists of 
the sequence 5' ('(iCiAATrCGGTTATTATCTrAAClGATG 3' (SEQ ID NO:18): and PI4 
which consists of the sequence ,V CXJGAATTCTTGATAACTGGAT TI GTGAC 3* (SKQ ID 
N():l*>). I he amino acid sequence consisting of amino acid residues 1X52 through 2362 of 
toxin B is listed in SEQ ID NO:20. The amino acid sequence consisting of amino acid 
residues 1755 through 2362 of toxin B is listed in SEQ ID N0:2L 

ilosnidium iliffitilv VPI strain 10463 was obtained from the American I'vpc Culture 
Collection (ATCC 43255) and grown under anaerobic conditions in brain-hcarl infusion 
medium (Becion Dickinson). High molecular-weight C. i/if/icilc DNA was isolated essentially 
as described (Wren and Tabaqchali (1^)87) J. Clin. Microbiol.. 25:24021. except 1) 100 ^ig/ml 
proteinase K in 0.5% SDS was used to disrupt the bacteria and 2) cciytrimcthx lammonium 
bromide (CTAB) precipitation jas described by Ausubel cf uL. Eds.. Cuneni Proiocols in 
Molecular Bioloiry^ Vol. 2 (1989) (Airrent Prolocolsl was used to remove carbohydrates from 
the cleared lysaie. Briclly. after disruption of the bacteria with proteinase K and SDS, the 
solution is adjusted to approximately 0.7 M NaCl by the addition ol a 1/7 volume of 5M 
NaCl. A 1/10 xolume of CTAB/NaCI (10% C I AB in 0.7 M NaCl) solution was added and 
the solution was mixed thoroughly and incubated 10 min at 65°C. An equal volume ol* 
chloroform/i.soaniyl alcohol (24:1) was added and the phases were thoroughly mixed. The 
organic and aqueous pha.scs were separated by centrifugal ion in a microfuge for 5 min. The 
aqueous supernatant was removed and extracted with phenol/chloroform/ isoamyl alcohol 
(25:24:1 ). The phases were separated by centrifugation in a microfuge for 5 min. The 
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superriatani was transferred to a JVesh lube and the DNA was precipitated with isopropanol. 
The DNA precipiiaic was pelleted by brief ccntrifugaiion in a microfugc. The DNA pellet 
was washed with 70% elhanol to remove residual CTA13. The DNA pellet was then dried 
and redissolved in TH buffer (10 mM Tris-HCI pH8.0. I mM L-DTA). The inieuriiy and 
yield of genomic DNA was assessed by comparison with a serial dilution of uncut lambda 
DNA after electrophoresis on an agarose gel. 

Toxin B fragments were cloned by PCR utilizing a proofreading thermostable DNA 
polymerase | native /'/// polymerase (Straiagene)|. The high ndelity of this polymerase 
reduces the mutation problems associated with amplification by error prone polymerases (l> 
Taq polymerase). PCR amplification was performed using the VCR primer pairs l>5 (SliQ ID 
NO:l I ) with P6 (S[:g ID NO:l2) and P7 (SKQ ID N0:I3) with P8 (SHQ ID NO:14) in 50 
III reactions containing 10 mM Tris-llCI pH8.3. 50 mM KCl 1.5 mM MgC'l. 200 ^iM of 
each d\'TP. 0.2 ^iM each primer, and 50 ng iliffuilc genomic DNA. Reactions were 
overlaid with 100 pi mineral oil. heated to ^M°C I'or 4 min. ().5mI native rfu polymerase 
(Stratagene) was added, and the reactions were cycled 30 times at ^)4°(' for 1 min. 50^(^ fcir I 
min. 72"C (2 min for each kb of sequence to be amplified), followed by 10 min at 72°C;. 
Duplicate reactions were pooled, chloroform extracted, and elhanol precipitated. After 
washing in 70% elhanol. the pellets were resuspendcd in 50 ^il TI-: buffer (10 niM Tri.s-tlCI 
pH8.0. 1 mM nDTA). 

The P5/P6 amplification producl was cloned into pl.l(M9 as outlined below. lO^il 
aliquois of DNA were gel purified using the Prep-a-(iene kit (BioRad). and ligated to Snut\ 
restricted plJC'M) vector. Recombinant clones were isolated and eonllrnied by restriction 
digestion using standard recombinant molecular biology techniques (.Sambrook c/ ai, 1989). 
Inserts IVom two independent isolates were identified in which the toxin H insert was oriented 
such that the vector lktm\\\ and Suc\ sites were 5' and 3* oriented, respectively (plJCBIO- 
1530). I he insert-containing Dumlil/Sac/ fragment was cloned into similarly cut pl:'r23a-c 
Nceior DNA. and protein expression was induced in small scale cultures (5 ml) of identified 
clones. 

fotal protein extracts were isolated, resolved on SD.S-PACil£ gels, and toxin B protein 
identified by Western analy.sis utilizing a goat anti-toxin alfinily purified antibody (Tech 
Lab). Procedures for protein induction. SDS-PAGl:. and Western blot analysis were 
performed as described in Williams ci a/. (IW5). supra. In brief, 5 ml cultures of bacteria 
grown in 2XYT containing 100 Mg/ml ampicillin containing the appropriate recombinant clone 
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were induced to express recombinant protein by addition oJ IPTG to ImM. The E. coli hosts 
used were: BL2KDE3) or BL21(DE3)LysS (Novagen) for pET plasmids. 

Cultures were induced by the addition oflPTG lo a final concentration of 1.0 mM 
when the cell density reached 0.5 OD^^, and induced protein was allowed to accumulate for 
two hrs after induction. Protein samples were prepared b\- pelleting 1 ml aliquots of bacteria 
by ccnirifugation ( 1 min in microfuge). and resuspension of the pelleted bacteria in 150 ^1 of 
2X SDS-PAGC sample buffer (0.125 mM Tris-HCI pH 6.8. 2 mM EDTA. 6% SDS, 20% 
glycerol. 0.035% bromophenol blue: (i-mercaptoethanol is added to 5% before use). The 
samples were healed to QS^C for 5 min. then cooled and 5 or 10 ^ils loaded on 7.5% SDS- 
PACJC gcLs. High molecular weight protein markers (BioRad) were also loaded, to allow 
estimation of the MW of identified fusion proteins. After electrophoresis, protein was 
detected cither generally by staining the gels with Coomassie Blue, or specifically, by blotting 
u» nitroccllukwc for Western blot detection of specific immunoreactive protein. I he MW of 
induced toxin \^ reactive protein allowed the integrity of the toxin H reading (ranie to be 
determined. 

The pi:i'23b recombinant (pPBIO-1530) expressed high MW recombinant loxin B 
reactive protein, consistent with predicted values. This confirmed that frame terminating 
errors had not occurred during I'CR amplification. A pET23b expression clone containing the 
IO-1750aa interval of the loxin B gene was constructed, by fusion of the EcoRV-Spi'] 
fragment of the P7/P8 amplification product to the interval of the P5/P6 

amplification product (see I'tgure 18) in pl'BIO-1530. The integrity of this clone (pPBlO- 
1750) was confirmed by restriction mapping, and Western blot detection ol" expressed 
recombinant toxin B protein. Levels of induced protein fiom both pPBlO-IS.'O and pPBlO- 
I7?() were too low to facilitate purification of usable amounts oC recombinant toxin B protein, 
flic remaining 1750-2360 aa interval was directly cloned into pMAL or pHT expression 
vectors from the P7/P8 amplification product as described below. 

b) li:.\prcssiun Of The Toxin B Gene 

i) Overview Of Expre.ssion (Methodologies 

f ragments of the toxin B gene were expressed as either native t)r fusion proteins in E. 
coli. Native proteins were expressed in either the pl:T23a-c or pET16b expression vectors 
(Novagen). The pi:T23 vectors contain an extensive poly linker sequence in all three reading 
frames (a-c vectors) followed by a C-terminal poly-histidine repeat. The pETI6b vector 
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contains a N-terminal poly-hisiidine sequence immediately 5" to a small polylinker. The 
poly-liisiidine sequence binds to Ni-Chelale columns and allows affinity purification of tagged 
target proteins [Williams el uL (1995), supral These affinity tags arc small ( 10 aa for 
pETI6b. 6 aa for pHT23) allowing the expression and affinity purification of native proteins 
with only limited amounts of foreign sequences. 

An N-icrminal histidine-lagged derivative of pE 11 6b containing an extensive cloning 
cassette was constructed lo facilitate cloning of N-terminal poly-hisiidinc lagged loxin B 
expressing constructs. This was accomplished by replacemeni of the promoter region of the 
pl-T23u and b vectors with that of the pET16b expression vector. Each vector was restricted 
with BklW and AWt'L and the reactions resolved on a 1.2 % agarose gel. The pET16b 
promoter region (contained in a 200 bp B^il\\-Ndc\ fragment) and the promoter-lcss pET23 a 
or b vectors were cut from the gel. mixed and Prcp-A-(icne (BioRad) purified. The eluled 
DNA uas ligaied. and translbrmants screened Ibr promoter replacement by hco\ digestion of 
purified plasmid DNA (the pET16b promoter contains ihis siie. ihe pEr23 promoter does 
not), riicse clones (denoted pETI Iisa or b) contain the pET16b promoter (consisting of a 
pT7-lac promoter, ribosome binding site and poly-histidine (lOaa) sequence) fused at the NUa\ 
site to the extensive pET23 polylinker. 

All MliP llision proteins were constructed and expressed in the pMAl.'^'-c or 
pMAI.'^^-p2 N'cciors (New England Biolabs) in which the protein of interest is expressed as a 
C-ierminal fusion with MBP. All pET plasmids were expressed in either the B1.2l(I)Er>) or 
BL21(I)E3)I.ysS expression hosl.s, while pMal plasmids were expressed in the BE2I host. 

Larue scale (500 mis lo I liter) cultures of each recombinant were grown in 2X Y*I' 
broth, induced, and soluble protein J'ractions were isolated as described |Williams. el al. 
(1995). supral The .soluble protein extracts were affinity chromatographed to isolate 
recombinant fusion protein, as described [Williams e/ al.. (1995) xupral In brief, extracts 
containing tagged pET fusions were chromatographed on a nickel chelate column, and eluted 
using imidazole sails or low pll (pll 4.0) as described by the di.stributor (Novagcn or Qiagen). 
Extracts containing soluble pMAl. fusion protein were prepared and chromatographed in PBS 
buffer o\er an amylose resin (New England Biolabs) column, and eluted with PBS containing 
10 niM maltose as described |Williams ei al. (1995). .vz//7/y/|. 
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ii) Overview Of Toxin B Expression 

In both laiyc expression constructs described in (a) above, only low level (/.c. less 
than I mg/liter of intact or nondeyradcd recombinant protein) expression of recombinant 
protein was detected. A number of expression constructs containing smaller fragments of the 
toxin B gene were then constructed, to determine If small regions of the gene can be 
expressed to high levels {i.e.. greater than I mg/liter intact protein) without extensive protein 
degradation. .All were constructed by in frame fusions of convenient toxin B restriction 
fragments to either the pMAL-c. pliT23a-c, pHTI6b or pETIlisa-b expression vectors, or by 
engineering restriction sites at specific locations using PCR amplitlcation [using the same 
conditions described in (a) above). In all cases, clones were veriHed by restriction mapping, 
and. where indicated. DNA .sequencing. 

I'rolcin preparations from induced cultures ol" each of these constructs were analyzed, 
by SDS-l>A(ii;. to estimate protein .stability (Coomassic Blue siaining) and immunorcactivity 
against anti-toxin B specific amiserum (Western analy.sis). Higher levels of intact (i.e.. 
nondegraded). full length fusion proteins were observed with the smaller constructs as 
compared with the larger recombinants, and a series of expression constructs .spanning the 
entire toxin B gene were constructed (l-iuures 18. 19 and 20 and Table 2.1). 

Constructs that expressed signillcant levels of recombinant toxin B protein (greater 
than I mg/liicr intact recombinant protein) in /T, coli were idcntitied and a series of these 
clones thai spans the toxin B gene are shown in Figure 19 and summarized in Table 23. 
These clones svcre utilized to isolate pure toxin B recombinant protein from the entire toxin B 
gene. Significant protein yields uere obtained from piMAL expression constructs spanning the 
entire toxin B gene, and yields of full length soluble fusion protein ranged from an estimated 
I mg/liter culture (pMBI 1 00- 1 530) to greater than 20 mg/liter culture (pMB 1750-2360). 

Representative purifications of MBP and poly-histidine-tagged toxin B recombinants 
are shown in l-igures 21 and 22. Figure 21 shows a Cooma.ssle Blue stained 7.5% SDS- 
PACL: gel on which \ arious protein .samples extracted from bacteria harboring pMBI850- 
2360 were electrophoresed. Samples were loaded as follows: Lane 1: protein extracted from 
uninduced culture: Lane 2: induced culture protein: Lane 3: total protein from induced culture 
after sonicatlon: Lane 4: soluble protein: and Lane 5: eluted alllnliy purified protein. Figure 
22 depicts the purification of recombinant proteins expressed in bacteria harboring either 
pPBl 850-2360 (Lanes 1-3) or pPB 1750-2360 (Lanes 4-6). Samples were loaded as follows: 
uninduced total protein (Lanes I and 4): induced total protein (Lanes 2 and 5): and cluted 
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affinity purified proiein (Lanes 3 and 6). The broad range molecular weight protein markers 
(BioRad) are shown in Lane 7. 

Thus, although high level expression was not attained using large expression constructs 
from the toxin B gene, usable levels of recombinant protein were obtained by isolating 
induced protein from a scries of smaller pMAL expression constructs that span the entire 
toxin B gene. 

These results represent the first demonstration of the feasibility of expressing 
recombinant toxin B proiein to high levels in E, coli. As well, expression ol small regions of 
the puiaiive ligand binding domain (repeat region) ot loxin B as native protein yielded 
insoluble protein, while large constructs, or fusions to MBP were soluble (l-igure 19), 
demonstrating that specific methodologies arc necessary lo produce soluble fusion proiein 
from this interval. 

iii) Clone Construction And Expression Details 

A portion of the loxin B gene containing the loxin B repeal region | amino acid 
residues 1852-2362 of toxin B (SHQ ID NO:20)| was isolated by PCR ampliUcaiion of this 
inier\al of the loxin B gene from C. Jiffwilc genomic DNA. The sequence, and location 
within ihe loxin B gene, of the two PCR primers |P7 (SEQ ID NO: 13) and PS (SHQ ID 
NO: 1 4)1 used lo amplify this region are shown in Pigurc 18. 

DNA from the PCR amplification was purified by chloroform extraction and eihanol 
precipitation as described above. The DNA was restricted with Spc\. and the clea\ed DNA 
was rcsohed by agarose gel elecirophoresis. The restriction digc.siitMi with Spcl cleaved the 
3.6 kb amplification product into a 1.8 kb doublet band, fhis doublet band was cut from the 
gel and mixed with appropriately cut. gel purified pMALc or pFT23b vector. I hese vectors 
were prepared by digestion with /tindllL filling in the overhanging ends using the Klenow 
enzyme, and cleaving with Xha\ (pMALc) or Nhd (pET23b). The gel purified DNA 
fragments were purified using the Prep-A-Gene kit (BioRad) and the DNA was ligaied. 
transformed and putative recombinant clones analyzed by restriction mapping. 

pli'f and pMal clones containing the toxin B repeat insert (aa interval l7.S()-236() of 
toxin B) were verified by restriction mapping, using enzymes that cleaved specific sites within 
the loxin B region. In both cases fusion of the toxin B Spcl site with either the compatible 
Xha\ site (pMal) or compatible Xhcl site (pET) is predicted to creale an in frame fusion, fhis 
was confirmed in the case of the pMB 1 750-2360 clone by DNA sequencing of the clone 
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junction and 5' end of the toxin B insert using a MBP specific primer (New England 
Blolabs). In tiic case of the pET construct, the fusion of the blunt ended toxin B 3* end to 
the filled HintAU site should create an in-frame fusion with the C-lcrminal poly-histidine 
sequence in this vector. The pPB 1 750-2360 clone selected had lost, as predicted, the Hmctlll 
sue at this clone junction: this eliminated the possibility that an additional adenosine residue 
was added to the 3" end of the PCR product by a terminal transferase activity of the Pfn 
polymerase, since fusion of this adenosine residue to the filled HintnU site would regenerate 
the restriction site (and was observed in several clones). 

One liter cultures of each expression construct were grown, and fusion protein purified 
by affinity chromatography on either an amylose resin column (pMAL consimcts: resin 
supplied by New iingland Biolabs) or Ni-chelatc column (pET constructs: resin supplied by 
giagen or Novagen) as described [Williams a al. (1995), .wpral The integrity and purity of 
the fusion proteins were determined by Coomassie staining of SDS-PACrH protein gels as well 
as Western blot analysis with either an affinity purified goat polyclonal antiserum (Tech Lab), 
or a chicken poiycU)nal PUG prep, raised against the toxin protein (CTB) as described 
above m Mxample 8. In both cases, affinity purification resulted in yields in excess of 20 mu 
protein per liter culture, of which greater than 90% was estimated to be full-length 
recombinant protein. Il should he noted that the poly-hisiidine affinity tagged protein was 
released from the Qiagen Ni>NTA resin at low imidazole concentration (60 mM). 
necessitating the use of a 40 mM imidazole rather than a 60 mM imidazole wash step during 
purification. 

A peripla.smically secreted \ ersion of pMB 1 750-2360 was constructed by replacement 
^ ol the promoter and MBP coding region of this construct with thai from a related vector 
(pMAl.'"-p2: New Iingland Biolabs) in which a signal sequence is present al the N-terminiis 
of the MBP. such that fusion protein is exported. This was accomplished by substituting a 
Di^lW^EcoRV piomoter fragment from pMAL-p2 into pMB 1750-2360. The yields of secreted, 
affinity purified protein (recovered from o.smotic shock extracts as described by Riggs in 
( itrrcfv nroKKols in Molecular lUoln\^\\ Vol. 2. Ausubel. tv ui. lids. (1989). Current 
Proiocohs. pp. 16.6.1 - 16.6.14] from this vector (pMBpl 750-2360) were 6.5 mg/liier culture, 
of which 50% was estimated to he full-length fusion protein. 

The interval was also expressed in two non-overlapping fragments. pMBl 750- 1970 
was constructed by introduction of a frameshift into pMB 1 750-2360. by restriction with 
IfinMl. filling in the overhanging ends and religation of the plasmid. Recombinant clones 
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were selected by loss of the HinMl site, and further restriction map analysis. Recombinant 
protein expression from this vector was more than 20 mg/liicr of greater than 90% pure 
protein. 

The complementary region wius expressed in pMB 1970-2360. This construct was 
created by removal of the 1750-1970 interval of pMB 1 750-2360. This was accomplished by 
restriction of this plasmid with EcoRl (in the pMalc polylinker 5* lo the insert) and III. fiiling 
in the overhanging ends, and rcliuaiion of the plasmid. The resuliani plasmid, pMB 1 970-2360, 
was made using both intracellularly and secreted versions of the pMU 1 750-2360 vector. 

No fusion protein was secreted in the pMBpl 970-2360 version, perhaps due to a 
conformational constraint that prevents export of the fusion protein. However, the 
intracellularly expressed vector produced greater than 40mg/iiier of greater than 90% full- 
length fu.sion protein. 

Constructs to precisely express the toxin B repeats in either pMalc (pMB 1850-2360) or 
pl';T16b (pPBl 850-2360) were constructed as follows. The DNA interviil including the toxin 
B repeats was ?CR amplified as described above utilizing PCR primers fM4 (SKQ II) NO:19) 
and PX (SKQ ID NO: 14). Primer P14 adds a /-xoRl site immediately Hanking the start of the 
toxin H repeals. 

The amplified fragment was cloned into the pr7 Blue l -veclor (Novagen) and 
recombinant clones in which single copies of the PCR fragment were inserted in either 
orientation were selected (p 171 850-2360) and confirmed by restriction mapping. Fhe insert 
was excised from two appropriately oriented independently isolated p 17 1850-2360 plasmids. 
with /:o>Rl (5* end of repeats) and P.\t\ (in the Hanking polylinker of the vector), and cloned 
into HloIWPs/] cleaved pMalc vector. The resulting construct (pMB 1 850-2360) was 
conllrmed b> restriction analysis, and yielded 20 mg/l of soluble fusion protein [greater than 
90% intact {i.e.. nondegraded)) after affinity chromatography. Restriction of this plasmid with 
HhnnU and religaiion of the vector resulted in the removal of the 1970-2360 interval. The 
resultant construct (pMBI850-l970) expressed greater than 70 mg/liter of 90% full length 
fusion protein. 
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The pPB 1850-2360 construct was made by cloning a EcoRl (filled with Klenow)- 
BumHl fragment from a pT7 1 850-2360 vector (opposite orientation to that used in the 
pMBI 850-2360 construction) into Mel (filled)/5amHI cleaved pETl6b vector. Yields of 
airmity purified soluble fusion protein were 15 mg/liter. of greater than 90% full length 
fusion protein. 

Several smaller expression constructs from the 1750-2070 interval were also 
constructed in Hi.s-iagged pET vectors, but expression of these plasmids in the Bi.2l (DE3) 
hcsi resulted in the production of high levels of mostly insoluble protein (see Table 23 and 
Figure 19). These constructs were made as follows. 

pPB 1850- 1970 was constructed by cloning a BglW-HimnW fragment of pPB 1850-2360 
into BnlWHhunU cleaved pET23b vector. pPBI 850-2070 was constructed by cloning a 
im-rviiU fragment of pPB1850-2360 into BiilWHmcW cleaved pET23b vector. pPBI750- 
1970(c) was constructed by removal of the internal HintnW fragment of a pPB 1750-2360 
M-ctor in which the vector Hmd\\\ site was regenerated during cloning (presumably by the 
addition of an A residue to the amplified PCR product b> lerminal transferase activity of PJ„ 
polymerase). The pPBI750-l970(ii) construct wa.s made by insertion of the insert containing 
the A*/c'l-///,K/lI| fragment of pPB 1 750-2360 into idemically cleaved pETHisb vector. All 
constructs were confirmed by restriction digestion. 

An expression construct that directs expression of tiie 10-470 aa interval of toxin B 
was con.structed in the piVlalc vector as follows. A NhL'\ (a site 5' to the in.serl in the pI-:T23 
vector (-.4//li (lllled) fragmem of the to.xin B gene from pPBn)-l530 was cloned into Xhci\ 
(compatible with MreWHhum (lllled) pMalc vector. The integrity of the construct (pMBIO- 
470) was verilied by restriction mapping and DNA sequencing of the 5' clone junction using 
a MBP specific DNA primer (New England Biolabs). However, all expressed protein was 
degraded to the MBP monomer MW. 

A second construct spanning this interval (aa 10-470) was constructed by cloning the 
PCR anipiification product from a reaction containing the P9 (SEQ ID NO: 15) and P 10 (SEQ 
li> NO: 16) primers (Figure 18) into the pETHisa vector. This was accomplished by cloning 
the PCR product as an AcviRI-blunt fragment into EcoKX-HincW restricted vector DNA: 
recombinant clones were verified by restriction mapping. Although this construct (pPBIO- 
520) allowed expression and purification (utilizing the N-terminal polyhistidine affinity tag) of 
intact liision protein, yields were estimated at less than 500 ng per liter culture. 
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Higher yield of recombinant protein from this interval (aa 10-520) were obtained by 
expression of the interval in two overlapping clones. The IO-330aa interval was cloned in 
both pKTHisa and pMalc vectors, using the BamHl-AfllU (filled) DNA fragment from pPBIO- 
520. This fragment was cloned into Bamlil-Hindlll (filled) restricted pMalc or BamHl-HincM 
5 restricted pETHisa vector. Recombinant clones were verified by restriction mapping. High 
level expression of either insoluble (pliT) or soluble (pMal) fusion protein was obtained. Total 
yields of fusion proiein from the pMBlO-330 construct (Figure IS) were 20 mg/litcr culture, 
of which 10% was estimated to be full-length fusion protein. .Although yields of this interval 
were higher and >W% lull-length recombinant protein produced when expressed from the 

10 pET construct, the pMal fusion was utilized since the expressed protein was soluble and thus 
more likely to have the native conformation. 

The pMB260-520 clone was constructed by cloning luoRl-Xhal cleaved amplified 
DNA from n PCR reaction containing the PIl (S[:0 IH N():I7) and PIO (SFQ ID N():16) 
DNA primers (Figure 18) into similarly restricted pMalc vector. Yields of affinity purified 

15 proiein were 10 mg/liter. of which approximately 50% was estimated to be full-length 
recombinant protein. 

riie aa5IO-l 1 10 interval was expressed as described below, fhis entire interval was 
expressed as a pMal fusion by cloning the Nhc\'llinJ\\\ fragment of pUCBIO-1530 into Xha\- 
llmtRW cleaved pMalc vector, fhe iniegruy of the construct (pMU5IO-l I 10) was verified by 

20 restriction mapping and DNA sequencing of the 5" clone junction using a MBP specific DNA 
primer. The yield oi' affinity purified protein was 25 mg/liter culture, of which 5% was 
estimated to he full-length fusion protein (1 mg/litcr). 

fo attempt to obtain higher yields, this region was expressed in two fragments (aa5IO- 
«20, and 820-1 1 10) in the pMalc vector. 1hc pMBS 10-820 clone was constructed by 

25 insertion of a SacX (in the pMalc polylinkcr 5* to the insert )-///?£/! DNA fragment from 
pMB510-ll 10 into Sacl/Siul restricted pMalc vector, fhe pMB820-! 1 10 vector was 
constructed by insertion of the Hpal'llindlU fragment of pllCBIO-1530 into liamHl (filled)/ 
/•//m/Ill cleaved pMalc vector, fhe integrity of these constructs were verified by restriction 
mapping and DNA sequencing of the 5* clone junction using a MBP specific DNA primer. 

30 Recombinant protein expressed from the pMB5 10-820 vector was highly unstable. 

However, high levels (20 mg/liier) of >y0% full-length fusion protein were obtained from the 
pMB820-! 105 construct. The combination of partially degraded pMB5IO-l 1 10 protein 
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(enriched for the 510-820 inlerval) with the pMB820-l 1 10 protein provides usable amounts of 
recombinant antigen from this inter\'al. 

Tiie aa MOO- 1 750 interval was expressed as described below. The entire interval was 
expressed in the pMalc vector from a construct in which the ^trl(nilcd)..S>t'I fragment of 
pl>BI0-l750 was inserted into Siul/Xhal (Xbal is compatible with Spcl: Sn,l and filled Accl 
sites arc both blunt ended) restricted pMalc. The integrity of this construct (pMBI 100-1750) 
was verilled by restriction mapping and DNA sequencing of the clone junction using a MBP 
specific DNA primer. Although 15 mg/liter of aOlniiy purilled protein was isolated from celLs 
harboring this construct, the protein was greater than 09% degraded to MBP monomer size. 

A smaller derivative of pMBI 100-1750 was constructed by restriction of pMBI 100- 
1750 with ,4//II and .SV//I (in the pMalc polylinker 3' to the insert), tilling in ihe overhanging 
ends, and religating the plasmid. The resultam clone (verified by restriction digestion and 
DNA sequencing) has deleted the aal5.-'>0-l750 interval, was de.signated pMB! IO0-15.-J0. 
pMBl 100-15.10 c.xpres.scd recombinant protein at a yield ol greater than 40 nig/liicr. of which 
5% was estimated to be full-length fusion protein. 

Three constructs were made to express the remaining interval. Initially, a DspHl 
(rilled)-.S/,H Iragmcm from pPBlO-1750 was cloned into £a>RUniled)/.VA«l cleaved pMalc 
vector. The integrity of this construct (pMBI57O-l750) was verified by restriction mapping 
and DNA .setiueneing of the 5" clone Junction using a MBP specific DNA primer, l-.xprcssion 
of recombinant protein from this plasmid was very low. appro.ximately .1 mg affinity purified 
protein per liter, and most was degraded to MBP monomer size. This region was 
subsequently cxprcs.scd from a PCR amplified DNA fragment. A PCR reaction utilizing 
primers PLl |.SEO ID N();I8: P13 was engineered to introduce an EcoRI .site 5" to amplified 
loNin 13 scciiiencesl and P8 (SHQ ID N():14) was performed on C. c/ijficik' genomic DNA as 
described above. The amplified fragment was cleaved with EoRl and Spd. and cloned into 
i:co\U/Xha\ cleaved pMalc vector. The resultant clone (pMB 1530- 1750) was verified by 
restriction map analysis, and recombinant protein was expressed and purified. The yield was 
greater than 20 .iig protein per liter culture and it was estimated that 25% was full-length 
fusion protein: this was a significantly higher yield than the original pMBl 570-1750 clone. 
The insert of pMB 1530- 1 750 (in a £c7,RI-.WI fragment) was transferred to the ptTIIisa 
N cctor lEcoRUXhol cleaved. Xho\ and Sail ends are compatible). No detectable fusion 
protein was purified on Ni-Chelate columns from soluble lysates of cells induced to express 
fusion protein from this construct. 
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TABLE 23 

Summary Of Toxin B Expression Constructs'' 



\^ iiinc 


Affinity Pag 


Yield (nig/liter) 


% I'ull Length 


pPBIO-1750 


none 


low (esiiniaicd from Western 
blot hyb.) 


•» 


pPBI()-l530 


none 


low (as above) 


»» 


pMB 10-470 


MBP 


I5mi!/I 


0% 


prn 


poly-his 


0.5 mg.' I 


20% 


pPH IO-3jO 


poly-his 


-20mg/l (insoluble) 


90% 


pM BIO' 3 30 


MBP 


20m\i/l 


fO% 


pM 8260-^20 


MBP 


lOm^/l 


50% 


.t, A J 13 i lit 1 1 1 §% 

pM/J^i / U-t f tu 


MBP 


25m^/l 


5% 


pMB5 10-820 


MBP 


deeraded (by Western blot 
)iyb) 






MBP 


20m\i.^l 


yr;% 


pMBI 100-1750 


MBP 


15mg/l 


0% 


pMBi 100-1530 


MBP 






pMB 1570- 1750 


MBP 


jmg/l 


• y/o 


pPBI 530- 1750 


poly-his 


no purified protein detected 


■1 


I}\mi530-r50 


MBP 


20mj^/l 


25% 


pxmr 50-2360 


MBP 


'20mji// 




pMBp 1750-3360 


MBP 


6.3mg/l (secreted) 


50% 


pI»B 1750-2360 


poly-his 


■20mg/l 


90% 


pMB 1750- 1970 


MBP 


'20mg/l 


•90% 


pMB 1 070-2360 


MBP 


40mg/l 


90% 


pM Bp 1*^70-2360 


MBP 


(no secretion) 


NA 


pMRI 850-2360 


MBP 


20mg/l 


90% 


pPR 1850-2360 


poly-his 


ISmg'l 


•90% 


pMBIK.M)-l07n 


MBP 


70m[i/l 


90% 


pPB1850-l970 


poly-his 


-lOmul (insnliibic) 


•90% 


pPB 1850-2070 


poly-his 


•lOmg/l (insoluble) 




pPB 1750- 1970(c) 


poly-his 


lOni^/l (insoluhlt;) 


<Hm 


pPB 1750- 1970(11) 


poly-his 


■ lOniu.'l ( insoluble) 


90% 



Clones in italics arc clones currently utilized to purify recombinant protein from each selected 
interval. 
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occurs with the CTB antlbody-recombinant mixture, that recombinant contains only weak or 
non-neutralizing epitopes of toxin B. This assay was performed as follows. 

Antibodies against CTB were generated in egg laying Leghorn hens as described in 
Kxample 8. The lethal dosage (LD of C. diflkile toxin B when delivered I.P. into 40g 
female Golden .Syrian hamsters (Charles River) was determined to be 2.5 to 5 ^ig. Antibodies 
generated against CTB and purified by PKG precipitation could completely protect the 
hamsters at the I.P. dosage determined above. The minimal amount of CTB antibody needed 
10 alford good protection against 5 Mg of CTB when injected I.P. into hamsters was also 
determined (IX PEG prep). These experiments defined the parameter.s needed to test wheiher 
a given recombinant protein could deplete protective CTB antibodies. 

The cloned regions tested for neutralizing ability cover the entire toxin B gene and 
were designated as Intervals (INT) I through 5 (see Figure I')). Approximately equivalent 
final concemrations of each recombinant polypeptide sveie tested. The following recombinant 
polypeptides were u.sed: I ) a mixture of intervals 1 and 2 (INT- 1. 2): 2) a mixture of 
Intervals 4 and 5 (lNT-4. 5) and 3) Interval .1 (INT-.l). Recombinant proteins (each at about 
I mg total protein) were llr.st preincubatcd with a final CTB antibody concentration of iX 
[i.e.. pellet disjiolved in original yolk volume as described in Example I(c)| in a final volume 
of 5 mis for I hour at 37°C. Twenty-live Mg of CTB (at a concentration of 5 ng/ml). enough 
CTB to kill 5 hamsters, was then added and the mixture was then incubated for I hour at 
.■i7°C. Five. 40g female hamsters (Charles River) in each treatment group were then each 
yivcn I ml of the mixture I.P. using a tuberculin syringe with a 27 gauge needle. The results 
of this experiment are shown in Table 24. 

TABLE 24 



Treaimen! Group' 


Number Of Animals Alive 


Number Of Animals Dead 


CTB aniibodies 


3 


2 


CTB annbodies - !NTI.2 




2 


CTB antibodies • INT4.5 




2 


CTB antibodies + INT 3 


0 


5 



C. dijfkih lo.xin B (CTB) was added to each group. 

As shown in Table 24. the addition of recombinant proteins from I NT- 1. 2 or INT-4, 5 
had no effect on the m vivo protective ability of the CTB antibody preparation compared to 
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EXAMPLE 19 

I denlifi cation. Purification And Induction Of Neutralizing 
Antibodies Against Recombinant C difficile Toxin B Protein 

' 5 1 0 determine whether recombinant toxin B polypeptide fragments can generate 

neutralizing antibodies, typically animals would first be immunized with recombinant proteins 
and anti-recombinani antibodies arc generated. These anli-recombinant protein antibodies are 
then tested for neutralizing ability //; vivo or in vitro. Depending on the immunogenic nature 
of the recombinant polypeptide, the generation of high-titer antibodies against that protein 

10 may take several months. To accelerate this process and identify which recombinant 

polypcplide(s) may be the best candidate to generate neutralizing antibodies, depletion studies 
were performed. Specifically, rccombinani toxin B polypeptide were prc-screened by testing 
whether they have the ability to bind to protective antibodies from a CTB antibody 
preparation and hence deplete those antibodies of their neutralizing capacity. Those 

15 recombinant polypeptides Ibund to bind CTB, were then utilized to generate neutralizing 

antibodies. This lixample involved: a) identification of recombinant sub-regions within toxin 
B to which neutralizing antibodies bind: b) identification of toxin B sub-region specific 
antibodies that neutralize toxin li in vivo: and e) generation and evaluation ol' antibodies 
reacti\e to recombinant toxin polypeptides. 

20 

ii) Identification Of Recombinant Sub-Regions Within Toxin B 
To Which Ncutrali/ing Antibodies Bind 

Sub-regions within toxin B to which neutralizing antibodies bind were identified by 
utilizing recombinant toxin B proteins to deplete protective antibodies from a polyclonal pool 

25 ot" antibodies against native ( " difficHc toxin B. An in v/w> assay was developed to evaluate 
protein preparations for the ability to bind neutralizing antibodies. Recombinant proteins were 
first pre-mixed with antibodies directed against native toxin B (CTB antibody: see Kxample 8) 
and allowed to react for one hour at 3TC. Subsequently. L\ ciifficilc toxin B (C TB; Tech 
I.ab) was added at a concentration lethal to hamsters and incubated for another hour at 37'*C. 

30 .M'ter incubation this mixture was injected intraperitoneally (IP) into hamsters. If the 

recombinant polypeptide contains neutralizing epitopes, the (TB antibodies will lose its 
ability to protect the hamsters against death from CTB. if partial or complete protection 
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the CTB antibody preparation alone. In contrast. INT-3 recombinant polypeptide was able to 
remove all of the toxin B neutralizing ability of the CTB antibodies as demonstrated by the 
death of all the hamsters in that group. 

The above experiment was repeated, using two smaller expressed fragments (pMB 
1750-1970 and pMB 1970-2.360. .sec Figure 19) comprising the lNT-3 domain to determine if 
that domain could he further subdivided into smaller neutralizing epitopes. In addition, full- 
length lNT-3 polypeptide expressed as a nickel tagged protein (pPB 1750-2360) was tested for 
neutralizing ability and compared to the original INT-3 expressed MBP fusion (pMB1750- 
2360). The results arc shown in Table 25. 

TABLE 25 



Treatment (iroup' 


Number Of Animals Alive 


Number Of Animals Dead 


C TB iintibodics 


5 


ri 


CTB anilbodies ♦ pPB 1750-2360 


0 


5 


CTB umibotiics t pMB 1750-2360 


0 


5 


CTB aniibodics * pMBI970-2360 


3 


"i 


CTB nmibodics pMB 1750-1970 


2 


3 



15 



20 



30 



( iiiUktic uixin B (CTB) was added to each group. 

The results summarized in Tabic 25 indicate that the smaller polypcpiide fragmenus 
within the INT-3 domain. pMBI750-l970 and pMB1970.2360. partially lose the abilily to 
bind to and remove neutralizing antibodies from the CTB antibody pool. These results 
demonstrate that the full length INT-3 polypeptide is required to completely deplete the CTB 
antibody pool of neutralizing antibodies. This experiment also shows that the neutralization 
epitope of lNT-3 can be expressed in alternative vector systems and the results are 
independent of the vector utilized or the accompanying fusion partner. 

Other Interval 3 specific proteins were subsequently tested for the ability to remove 
neutralizing antibodies within the CTB antibody pool as described above. The Interval 3 
specific proteins used in these studies are summarized in Figure 23. In Figure 23 the 
following abbreviations arc used: pi' refers to the pET23 vector: pM refers to the pMALc 
vector: B refers to toxin B: the numbers refer to the amino acid interval expressed in the 
clone, rhe solid black ovals represent the MBP: and HHH represcnt.s the poly-hisiidinc tag. 
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Only recombinant proteins comprising the entire toxin B repeat domain (pMB1750- 
2360, pPB 1750-2360 and pPB 1850-2360) can bind and completely remove neutralizing 
antibodies from the CTB antibody pool. Recombinant proteins comprising only a portion of 
the toxin B repeat domain were not capable of completely removing neutralizing antibodies 
5 from the CTB antibody pool (pMB1750.|970 and pMB 1970-2360 could partially remove 
neutralizing antibodies while pMB1850-lQ70 and pPB 1850-2070 failed to remove any 
neutralizing antibodies from the CTB antibody pool). 

The above results demonstrate that only the complete ligand binding domain (repeat 
region) of the toxin B gene can bind and complcicly remove neutralizing antibodies from ihc 
10 CTB antibody pool. These results demonstrate that antibodies directed against the entire toxin 
15 repeat region are necessary for in vivo toxin neutrah'zaiion (see Figure 23: only the 
recombinant proteins expressed by the pMBl 750-2360. pPB 1 750-2360 and pPB 1 850-2360 
vectors arc capable of completely removing the neutralizing antibodies from the C fB 
aniibod> pooh. 

I> These results represent the first indication that the entire repeat region of toxin B 

would be necessary for the generation of antibodies capable of neutralizing toxin B. and that 
sub-regions may not be sufficient to generate maximal titers of neutralizing antibodies. 

b) Identification Of Toxin B Sub-Region Specific Antibodies 

-0 That Neutralize Toxin B In Vivo 

l o determine if antibodies directed against the toxin B repeat region are sufficient for 
neutralization, region specific antibodies within the (' I B antibody preparation were affinity 
purified, and tested for in vivo neutralizxiiion. Affinity columns containing recombinant toxin 
B repeat proteins were made as described below. A separate affinity column was prepared 

25 using each of the following recombinant toxin B repeat proteins: pPB17.S0-2360. pPB1850- 
2360. pMB 1750- 1970 and pMB 1970-2360. 

lor each affinity column to be made, four ml of PBS-washcd Actigel resin (Sterogene) 
was coupled overnight at room temperature with 5-10 mg of affinity purified recombinant 
protein (first extensively dialyzed into PBS) in 15 ml tubes (Falcon) containing a 1/10 final 

30 \olume Aid-coupling solution (I M sodium cyanoborohydride), Aliquots of the supernaiants 
from the coupling reactions, before and after coupling, were assessed by Coomassie staining 
of 7.5% SDS-PA(iE gels. Based on protein band intensities, in all eases greater than 30% 
coupling efficiencies were estimated. The resins were poured into 10 ml columns (BioRad), 
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washed extensively with PBS. pre-eluted with 4M guanidine-HCI (in 10 mM Tris-HCi. pH 
8.0) and reequilibrated in PBS. The columns were stored at 4^C. 

Aliquots of a CTB IgY polyclonal antibody preparation (PEG prep) were affinity 
purified on each of the four columns as described below. The columns were hooked to a UV 
monitor (ISCO). washed with PBS and 40 ml aliquots of a 2X PEG prep (filter sterilized 
usiny a 0.45 m filter) were applied. The columns were washed with PBS until the baseline 
\aluc was re-established. The columns were then washed with BBStwecn to elute 
nonspecifically binding antibodies, and reequilibrated with PBS. Bound antibody was eluted 
from the column in 4M guanidine-HCI (in lOmM Tris-HCI. pH8.0). The eluted antibody was 
immediately dialyzed against a 100-fold excess of PBS at 4°C for 2 hrs. The samples were 
then dialyzed extensively against at least 2 changes of PBS. and affinity purified antibody was 
collected and stored at 4%'. The antibody preparations were quantified by UV absorbancc. 
The elution volumes were in the range of 4-8 ml. All affinity purified stocks contained 
similar total aniibtidy concentrations, ranging from 0.25-0.35% of the total protein applied to 
15 the columns. 

llic ability of the affinity purified antibody preparations to neutralize toxin B in vivo 
was determined using the assay outlined in a) above. Affinity purified antibody was diluted 
1:1 in PBS before testing. The results arc shown in Table 26. 

In all cases similar levels of toxin neutralization was observed, such that lethality was 
dela\cd in all groups relative to prcimmune controls. This result demonstrates that antibodies 
leaciivc to liic repeat region of the toxin B gene are sutlicient to neutralize toxin [3 //; vivo. 
The hamsters will ..•ventually die in all groups, but this death is maximally delayed with the 
(TH PIXi prep antibodies. Thus neutralization with the affinity purified (AP) antibodies is 
not as complete as that observed with the CTB prep before affinity chromatography. This 
result may be due to loss of activity during guanidine denaturation (during the elution of the 
antibodies from the affinity column) or the presence of antibodies specific to other regions of 
the toxin B gene that can contribute to toxin neutralization (present in the CTB PI-G prep). 



20 
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TABLE 26 

Nemralizacion Of Toxin B By Affiniiy Puritled Amibodies 



5 ' 



Treatment group* 


Number Animals Alive*' 


Number Animals Dead*" 


Preimmunc' 


0 


5 


CTn':400 Mg 


5 


0 


CTB (Al> on pPB 1750-2360):- 875 Mg 


s 


0 


CTB (AP on pMB 1750. 1070):- 875 y\g 


5 


0 


CTB (AP on pMB 1970-2360):- 500 yi» 


s 


0 



' <■ diffkile toxin B (CTB) (Tech Lab: at 5 Mg/nil. 25 y\» total) ni lethal coiKcniration lo 

hamsters is added to antibody and incubated for one hour ai 37"C. Atier incubation this 
mixture is Injected intraperitoneal ly (IP) into hamsters. Rach treatment group received loxin 
prcfTiixed with antibody raised against the indicated protein, as either: '4X antibody PEG prep 
or •aflmily purified (AP) iinlibody (from CTB PRC prep, on indicated columns). The amount 
' ^ 111 specific antibody in each prep is indicated; the amount is directly determined for affmity 

purified preps and is estimated for the 4X C TB as described in lixampic 15. 

The numbers in each iiroiip represent numbers of hamsters dead or alive. 2 hr post II* 
administration of toxin/antibody mixture. 

20 

I hc (ibservalion that antibodies afllniiy purified against the non-ovcrlappinii 
pMBI750-107() and pMBl 970-2360 proteins neutralized loxin W raised the possibility that 
either I) antibodies specifie to repeat sub-regions are sufTieienl to neutralize toxin B or 2) 
sub-region specific proteins can bind most or all repeat specific antibodies present in the CTB 

25 polyclonal pool. This would likely be due lo contormalional similarities between repeats. 

since homology in the primary amino acid sequences between different repeats is in the range 
of only 25-75% (Eichel-Strciber. et ai (1992) Molec. Cien. Cienetics 233:260|. I hesc 
|X)ssibilities were tested by affinity chromatography. 

The CTB PEG prep was seciucnlially depleted 2X on the pMBI750-1970 column: only 

30 a small elution peak was ob.served alter the second chromatography, indicating that most 
reactive antibodies were removed. This interval depleted CTB preparation was then 
chromalographed on the pPB 1 850-2360 column: no antibody bound to the column. The 
reactivity of the CTB and CTB (pMB!750-I970 depleted) preps to pPBI750-2360. pPB1850- 
2360. pMB1750-1970 and pMBI970-2360 proteins was then determined by 1{L1SA using the 

35 protocol described in Example 13(c). Briefiy. 96-well microtitcr plates (l alcon, Pro-Bind 
Assay Plates) were coaled with rect>mbinanl protein by adding 100 ^il volumes of protein at 
1-2 ^ig/ml in PBS containing 0.005% thimerosal lo each well and incubating overnight at 4°C. 
The next morning, the coaling suspensions were decanted and ihe wells were washed three 



- 124 - 



wo 98/08540 



PCT/US97/15394 



times using PBS. In order to block non-specific binding sites. 100 pi of 1.0% BSA (Sigma) 
in PBS (blocking solution) was then added to each well, and the plates were incubated for I 
hr. at 37°C. The blocking solution was decanted and duplicate samples of 150 fil of diluted 
antibody was added to the first well of a dilution series. The initial testing scrum dilution 
was (1/200 for CTB prep, (the concentration of depleted CTB was standardized by OD,«,) in 
blocking solution containing 0.5% Tween 20. followed by 5-fold serial dilutions into this 
solution. This was accomplished by serially transferring 30 mI aliquots to 120 jtl buffer, 
mi.xing. and repeating the dilution into a fresh well. After the final dilution. 30 ul wa.s 
removed from the well such that all wells contained 120 mI fmal volume. A total of 5 such 
dilutions were performed (4 wells total). The plates were incubated for 1 hr at 37°C. 
F'ollowing this incubation, the serially diluted samples were decanted and the wells were 
wushcd three times using PBS containing 0.5% Tween 20 (PBST). followed by two 5 min 
washes u.sing BBS-Tween and a final three washes using PB.ST. To each well. 100 nl of 
I . lOOO diluted secondary antibody (rabbit anli-chickcn IgCi alkaline phosphaia.se (Sigma) 
diluted in blocking solution containing 0.5% Tween 20 1 was added, and the plate was 
incubated I hr at 37»C. The conjugaic solutions were decanted and the plates were washed 6 
times in PB.ST. then once in 50 mM NaX'O,. 10 mM MgCU. pH 9.5. The plates were 
developed by the addition of 100 mI of a solution containing I nig/ml para-nitro phenyl 
phcLsphate (.Sigma) dis.solved in 50 mM NaX'O,. 10 mM MgCI,. pH9.5 lo each well. I hc 
plates wore then incubated at room temperature in the dark for 5-45 min. The absorbency of 
each well was measured at 410 nni using a Dynatech MR 700 plate reader. 

As predicted by the affinity chromatography results, depletion of the CTB prep on the 
pMBI 750-1970 column removed all detectable reactivity lo the pMBI970-2360 protein. The 
reeiproea! purification of a C'l B prep that was depleted on the pMB1970-2360 column 
yielded no bound antibody when chromatographed on the pMBI750-l970 column. These 
results demonstrate that all repeat reactive antibodies in the CTB polyclonal pool recognize a 
conserved striieiure that is prcsem in non-overlapping repeats. Although it Is po.ssible tliat this 
conserveti structure represents rare conserved linear epitopes, it appears more likely that the 
neutralizing antibodies recognize a speeille protein conformation. This conclusion was aLso 
suggested by ihc results of Western blot hybridization analysis of CTB reactivit\ to these 
recombinant proteins. 

Western blots of 7.5% SDS-PAOE gels, loaded and electrophoresed with defined 
quantities of each recombinant protein, were probed with the CTB polyclonal antibody 
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preparation. The blots were prepared and developed with alkaline phosphatase as described in 
Example 3. The results are shown in Figure 24, 

Figure 24 depicts a comparison of immunoreacliviiy of IgY antibody raised against 
either native or recombinant toxin B antigen. Rqual amounts of pMB 1750- 1^70 (lane 1 ), 
pMB1970-2360 (lane 2). pPBI850-2360 (lane 3) as well as a serial dilution of pPBl 750-2360 
(lanes 4.(> comprising IX. 1/1 OX and 1/lOOX amounts, respectively) proteins were loaded in 
duplicate and resolved on a 7.5% SDS-PAGH gel. The gel was blotted and each half was 
hybridi7.ed with PEG prep IgY antibodies from chickens immunized with cither native CTB or 
pPBl 750-2360 protein. Note that the full-length pMB1750-l970 protein was identified only 
by antibodies reactive to the recombinant protein (arrows). 

Although the CTB prep reacts with the pPBl 750-2360. pPB 1850-2360. and plVIB1970- 
2360 proteins, no reactivity to the pMB1750-l970 protein was observed (I'igurc 24). (iiven 
that all repeat reactive antibodies can be bound by this protein during affinity 
chromatography, this result indicates that the protein cannot fold properly on Western blots. 
Since this eliminates all antibody reactivity, it is unlikely that the repeat reactive antibodies in 
the CTB prep recognize linear epitopes. This may indicate that in order to induce protective 
antibodies, recombinant toxin B protein will need to be properly folded. 

c) (icncrntion And Evaluation Of Antibodies Reactive To 
Recombinant Toxin B Polypeptides 
i) Generation Of Antibodies Reactive To 
Recombinant Toxin B Proteins 

Antibodies against recombinant proteins were generated in egg laying Leghorn hens as 
described in li.xample 13. Antibodies were raised (using Freunds adjuvant (Gibco) unless 
otherwise indicated] against the following recombinant proteins; I) a mixture of Interval 1+2 
proteins (see Figure 18); 2) a mixture of interval 4 and 5 proteins (see Figure 18): 3) 
pMBI970-2360 protein: 4) pPB175()-2360 protein: 5) pMB 1750-2360: 6) pMB175O-2360 
n'iiermax adjuvant (Vaxcell)]: 7) pMBI750-2360 |Gerbu adjuvant (Biotech)|: 8) pMBpl750- 
2360 protein: 9) pPBI 850-2360: and 10) pMB1850.2360. 

Chickens were boosled at least 3 times with recombinant protein until FLISA 
reactivity (using the protocol described in b) above with the exception that the plates were 
coated with pPB 1750-2360 protein] of polyclonal PHO preps was at least equal to that of the 
CTB polyclonal antibody PEG prep. ELISA titers were determined for the PECJ preps from 
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all of ihe above immunogens and were found to be comparable ranging from 1:12500 to 
1:62500. High titers were achieved in all cases except in 6) pMBl 750-2360 in which strong 
titers were not observed using the Titermax adjuvant, and this preparation was not tested 
further. 



ii) Kvaiuation Of Antibodies Rcacti%'c To 
Recombinant Proteins By Western Blot 
Hybridization 

Western blots of 7.5% SDS-PACiLi gels, loaded and dectrophorcsed with defined 
quantities of recombinant protein (pMB1750-l97a pPBl850-2360. and pMBl<)70.2360 
proteins and a serial dilution of the pPB 1 750-2360 to allow quantiticalion of reactivity), were 
probed with the CTB. pPB 1750-2360. pMB 1 750-2360 and pMB 1970-2360 polyclonal 
antibody preparations (from chickens immunized using Frcunds adjuvant). The blots were 
prepared and developed with alkaline phosphatase as described above in b). 

As shown in Figure 24. the C'FB and pMB 1 970-2360 preps reacted strongly with the 
pPB I 750-2360. pPB1850-2360. and pVIB1970-2360 proteins while the pPBI750-2360 and 
pMB 1 970-2360 {(lerbu) preparations reacted strongly with all four proteins. The Western 
blot reactivity nf the pPBI750-2360 and pMBI970-2360 (Gerbu) preparations were equivalent 
to that oi' the CTIi preparation, while reactivity of the pMB 1 970-2360 preparation was <10% 
that oj' the CTB prep. Despite equivalent ELISA reactivities only weak reactivity 
(approximately |%) to the recombinant proteins were observed in PEG preps from two 
independent groups immunized with the pMB 1750-2360 protein and one group immunized 
with the pMB I 750-2360 preparation using Freunds adjuvant. 

Affinity purification was utilized to determine if this dificrence in immunoreaciiviiy by 
Western blot analysis retkcts differing antibody tilers. Fifty ml 2X PEG preparations from 
chickens immunized with either pMB 1750-2360 or pMB 1 970-2360 protein were 
chromaiographed on the pPBl 750-2360 affinity column from b) above, as described. The 
yield ol afliniiy purified antibody (% total protein in preparation) was equivalent to the yield 
obtained from a CTB PEG preparation in b) above. Thus, differences in Western reactivity 
rellect a qualitative difference in the antibody pools, rather than c|uantifative differences.. 
Ihese results demonstrate that certain recombinant proteins are more effective at generating 
high affinity antibodies (as assayed by Western blot hybridization). 
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iii) In Vivo Neutralization Of Toxin B Using 

Antibodies Reactive To Recombinant Protein 

The w vivo hamster model [described in Examples 9 and 14(b)] was utilized lo assess 
the neutralizing ability of antibodies raised against recombinant toxin B proteins. The results 
IVoni three experiments are shown below in Tables 27-29. 

The abiliiy of each immunogen to neutralize toxin B in vivo has been compiled and is 
shown in lablc 30. As predicted from the recombinant proiein-CTB premix studies (Table 
24) only antibodies to Interval 3 (1750-2366) and not the other regions of toxin B (lc. 
intervals 1-5) are protective. Unexpectedly, antibodies generated to 1NT.3 region expressed ir 
pMAL vector (pMB 1750-2360 and pMpB 1750-2360) using Freunds adjuvant were non- 
neutralizing. This observation is reproducible, since no neutralization was observed in two 
independent immunizations with pMB 1 750-2360 and one immunization with pMpBI750. 
2360. I hc fact that 5X quantities of affinity purified toxin B repeat specilk antibodies from 
pMB 17.^0-2360 PIX] preps cannot neutralize toxin B while IX quantities of affinity purified 
15 anii-CTB antibodies can (Table 28) demonstrates that the differential abiliiy of CTB 

antibodies to neutralize toxin B is due to qualitative rather than quantitative differences in 
these antibody preparations. Only when this region was expressed in an alternative vector 
(pPBl 750-2360} or using an alternative adjuvant with the pMBl 750-2360 protein were 
neutralizing antibodies generated. Importantly, antibodies raised using I reunds adjuvant to 
20 pPB 1850-2360. which contains a fragment that is only 100 amiiu) acids smaller than 

recombinant pPBl 750-2360. are unable to neutralize toxin B in vivo (Table 27): note alsi^ that 
the same vector is used for both pPB 1850-2360 and pPB 1 750-2360. 
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Treatment Group'' 



Prcimmune 



CTB 



INTK2 



TABLE 27 

In Vivo Neutralization Of Toxin B 



Number Animals Alive'' 



0 



Number Animals Dead** 



INT4^S 



pMB 1 750-2360 



pMB 1970-2360 



pPB 1 750-2360 



(. Me toxm B (CTD) (at 5 Mu/nii: 25 total: Tech Lab) at lethal concentration to 
hamsters ,s added to antibody and .ncubated for one hour at After incubation li s 

rmlv^n' "y,"^^^ '•";'^P^^^««"^^»'y (iP) into hamsters. F.ach treatment group received toxin 
prem.xed with antibody raised against the indicated protein, as a 4X antibody PEG prep 

The numbers in each group represent numbers o!' hamsters dead or alive. 2 hours post IP 
administration of toxin/antibody mixture, 

TABLE 28 



Treatment Ciroup" 


Number Animals Alive'^ 


J ^\nunoaies 

Number Animals Dead" 


Preimmune( 1 ) 


0 


5 


CTB(I) 


5 


0 


pPB 1750-2360(1) 


5 


0 


1.5 mi! anti-pMBI750-2360f2) 


1 


■1 


1.5 mg ami-pMB 1970-2360(2) 


0 


> 


300 Mg anti-CTBf2) 


> 



uni«crs .s added to .-uiubody and incubated for one hour ai :,7»C. After incubation. I ml ol 
h.s mixture .s ,n,cctcd .ntraperitonealiy (IP) into hamsters. Each treatment eroup received 
lox.n prem.xed with antibody raised against the indicated protein, as either ( I ) 4X a.«ibodv 
7: "1!?.^:^-^ P""'"'^'' "'""""'y ^ PPB'75"-2360 r«si„,. either 1.5 JZol 

T'^C^'^ -cliluted amnity puriHed natiJc^ltu::..!. 

SJioup (anti-CTB. repeat specilic: used 1/5 diluted anti-CTB antibody). 

The numbers in each group represent numbers of hamsters dead or alive. 2 hr nosi-IP 
itdministration of toxin/antibodv nii.\ture. 



- 129 - 



WO98A)8540 



PCTAJS97/15394 



TABLE 29 



Genermion Of Neutralizing Antibodies IJiilizinu The Gcrbu Adjuviint 



Treatment Group' 


Number Animals Alive'* 


Number Animals Dead* 


Preimmunc 


0 


5 


CTB 


5 


0 


pMB 1970-2360 


0 


5 


pMB 1850-2360 


0 


5 


pPB 1850-2360 


0 


5 


pMB 1 750-2360 (Gcrbu adjl 


5 


0 



(' Jif/ki/c lax in B (CTB) (Tech Lab) ai lethal conccntraiion to lianisicrs is added lo antibodx 
;nid incubated Ibr one hour at 37**C. After incubation this mixture is injected iniraperitonealiy 
(IP) into hamsters, liach treatment group received toxin preniixed with antibody raised uiiainsi 
the indicated protein, as a 4X antibody PEG prep. 

riic numbers in each uroup represent numbers ot hunisiers deati or ali\e. 2lirs post IP 
administration of toxin antibody mixture. 
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TABLE 30 



llllfTlunOL'en 


Adjuvant 


1 

Tested Preparation'' 


Anti2en 
Utilized For AP 


/yj vivo 
Neutralization'* 


Preiminunc 


NA' 


PEG 


NA 


no 


CTB (native) 


Tiiermax 


PEG 


NA 


yes 


CTH (n«Hivcj 


f iiemiax 


AP 


pPB 1 750-2360 


yes 


CTB (niuive) 


niermax 


AP 


pPB 18.50-2360 


yes 


CTB (iiaiive) 


Tiiermax 


AP 


pPB 1750- 1970 


yes 


CTB (naiivct 


Tiiermax 


AP 


pPB 1970-2360 


yes 


pMB 1750-2360 


Freunds 


PEG 


NA 


no 


pMB 1750-2360 


Frcunds 


AP 


pPB 1 750-2360 


no 


pMB 1750-2360 


Ci crbi! 


PEG 


NA 


yes 


pMB 1970-2360 


Freund.s 


PEG 


NA 


no 


pMBM)70-2360 


Frcunds 


AP 


pPB 1750-2360 


no 


pPB 1 750-23^)0 


Freunds 


PEG 


NA 


yes 


pI»BI«5()-2360 


I-reunds 


PEG 


NA 


no 


pMB 1850-2360 


Frciinds 


PEG 


NA 


no 


INT 1 2 


Freunds 


PEG 


NA 


no 


IN'T4 5 


Ireunds 


PEG 


NA 


no 



I-jthcr PEG preparation (PEG) or alKlnity purified aiuibodies (AP). 

Yes- denotes complete neutralization (0-5 dead) while 'no* denotes no neutralization (S ^ dead) 
t>i tovin B. 2 hours post -administration of mixture. 

*N.'\* denotes not applicable. 

Tlic pPB 1 750-2360 antibody pool confers signiflcaiil in vivo protection, equivalent to 
ihat obtained with the affinity purified CTB antibodies. This correlate.s with the oKservcd high 
affinitx of this antibody p«)ol (relative to the pMBI750-2.-?60 t»r pMB1970.236() pools) as 
a.s.sayed by Western blot analysis (Figure 24). These results provide the first demon.stration 
that in vivo neutralizing antibodies can be induced using recombinant toxin B protein as 
tmnuinogcn. 

I he failure ol high concentrations of antibodies rai.sed against the pMB 1 750-2360 
protein (using Freunds adjuvant) to neutralize, while the u.se of Cierbu adjuvant and 
pMBl 750-2360 protein generates a neutralizing response, demonstrates that ct»nformation or 
presentation of this protein is e.s.sential for the induction of neutralizing antib(,dies. The.se 
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results are consistenl with the obser\'aiion that the neutralizing antibodies produced when 
native CTB is used as an immunogen appear to recognize conrormalional epitopes [see section 
b) above]. This is the first demonstration that the conformation or presentation of 
rcconihinanl toxin B protein is essential to generate high liters of neutralizing antibodies. 

EXAMPLE 20 

Determination Of Quantitative And Qualitative 
DitTerences Beiv^cen pMBI750-236(K pMB 1 750-2360 (Cicrbu) 
Or pPB 1 750-2360 IgY Polyclonal Antibod\ ['reparations 

In L'xample 19. it was demonslraicd thai toxin B neutralizing antibociics could he 
generated using specific recombinant toxin B proteins (pPB 1750-2360) or specillc adjuvants. 
Antibodies raised against pMB 1750-2360 were capable of neutralizing the entcrotoxin elTect 
of toxin M when the recombinant protein was used to immunize hens in conjunciion with the 
(icrbu adjuvant, but not when Freunds adjuvant was used. To determine the basis for these 
antigen and adjuNani restrictions, toxin B-specific antibodies present in the neutralizing and 
non-neutralizing PILG preparations were isolated by aflinity chromatography and tested for 
qualitative or quanliialive differences. The example involved a) purification of ami-toxin B 
specillc antibodies iVom pMB 1750-2360 and pPB 1 750-2360 Pl-G preparations and b) in vivo 
neutralization of toxin B using the affmity purified antibody. 

a) Purification Of specific Antibodies From piVim75n-2360 And 
pPBI 750-2360 PFX; Preparations 

To purify and determine the concentration of specific antibodies (expressed as the 
percent of total antibody) within the pPBI 750-2360 (Freunds and Gerbu) and pPBl 750-2360 
PEG preparations, defined quantities of these antibody preparations were chromatographed on 
an affinity column containing the entire toxin B repeat region (pPB 1 750-2360). The amount 
of alTinily purified antibody was then quantified. 

An affinity column containing the recombinant toxin B repeat protein. pPB 1 750-2.160. 
was made as follows, l-our ml of PBS-washed Acligel resin (Sierogene) was coupled with 5 
mg of pPB 1750-2360 affinity purified protein (dialyzed into PBS: estimated to be greater than 
05% full length fusion protein) in a 13 ml tube (Falcon) containing I/IO final volume Aid- 
coupling solution (IM sodium cyanoborohydride). Aliquots of the supernatant from the 
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couplini! reactions, before and after coupling, were assessed by Coomassie staining of 7.5% 
SDS-PAGE gels. Based on protein band iniensilies, greater than 95% (approximately 5 mg) 
of recombinant protein was coupled to the resin. The coupled resin was poured into a 10 ml 
column (BioRadl. washed extensively with PBS. pre-eluicd with 4M guanidine-HCI (in 10 
inM Tris-MCl. pM 8.0: 0,005% thimerosal) and re-equilibrated in PBS and stored at 4X. 

Aliquots of pMB 1750-2360. pMB1750-2360 (Gerbu) or pi>B1750-2360 IgY polyclonal 
antibody preparations (PEG preps) were affinity puritled on the above column as follows. 
The column was attached to an VV monitor (ISCO). and washed with PBS. l oriy ml aliquots 
of 2X PEG preps (filler sterilized using a 0.45 m lilier and quantilled by OI),,,, before 
chromatography) was applied. The column was washed with PRS until the baseline was re- 
established (the column How-through was saved), washed with BBSTween to cluic 
nonspccilically binding antibodies and re-equilibraied with PBS, Fiound antibody was cluied 
from the column in 4M guanidine-HCI (in 10 mM Tris-HCL. pH 8.0. 0,005% ihimerosal) and 
the entire clution peak collected in a 13 ml tube (Falcon). The column was rc-cquilibraicd. 
and the column cluatc re-chromaiographed as described above. The antibody preparations 
were c|uantilled by I IV absorbance (the eluiion buffer was u.scd lo zero the 
.spectrophotometer). Approximately !0 fold higher conceniraiions of total purified antibody 
was obtained upon cluiion of the lirst chromatography pass relative tt) the .second pass. The 
low yield from the second chromatography pass indicated that mo.st of ihc specillc antibodies 
were removed by the llrsl round o I' chromatography. 

Pools of affinily purified specillc antibodies were prepared by dialysis of the column 
clutes uficr ihe first column chromatography pass for the pMB 1 750-2360. pMBI 750-2360 
((icrbu) or pPB1750-2360 IgY polyclonal antibody preparations. The eluies were collected 
on ice and immediately dialy/.ed against a 100-fold volume of PBS at 4°C for 2 hrs. The 
samples were then dialyzed against 3 changes of a 65-fold volume of PBS at 4°C. Dialysis 
was performed for a minimum of 8 hrs per change of PBS. The dialyzed samples were 
collected, centrifuged to remove insoluble debris, quantilled by ()D.,„. and .stored at 4°C. 

The percentage of toxin B repeat-specific antibodies present in each preparation was 
determined using the quantifications of antibody yields from the first column pass (amount of 
specific antibody recovered after first pass/total protein loaded), fhe yield of repeat-specific 
affinity purified antibody (expressed as the percent of total protein in the preparation) in: I) 
the pMBl 750.2360 PEG prep was approximately 0.5%. 2) the pMB 1750-2360 (Cierbu) prep 
was approximately 2,5%. and 3) the pPBI 750-2360 prep was approximately 0.4%. 
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I'uriticaiion of a CTB IgY polyclonal antibody preparaiion on the same column demonsirated 
that the concentration of" toxin B repeat specific antibodies in the CTB preparaiion was 0.35%. 

These results demonstrate that 1 ) the use of Oerbu adjuvant enhanced the liter of 
speciHc antibody produced against the pMBl 750-2360 protein 5-fold relative to immunization 
using Freunds adjuvant, and 2) the differences seen in the in vivo neutralization ability of the 
PMB1750.2360 (not neutralizing) and pPB1750.2360 (neutralizing) and CTB (neutralizing) 
Pl-(j prep.s seen in l-xample was not due to differences in the tiicrs of repcat-spccillc 
antibodies in the three preparations because the titer nj' repeai-specilk antibody was similar 
for all three preps: therefore the differing ability of the three antibody preparations to 
neutralize toxin B must rellcci qualitative differences m the induced loxin B repeat-specific 
antibodies. To confirm that qualitative differences exist between antibodies raised in hens 
immunized with different recombinant proteins and/or different adjuvanis. the same amount of 
affinity puritled anti-toxin B repeal (aa 1870-2360 of loxin B) antibodies from the different 
preparations was administered to hamsters using the in vivo hamster model as described 
belovs. 



1)) /// I'fiY^ Neutralization Of Toxin B lising Affinity Purified 
Antibody 

I he in vivo hamster model was uiilized to assess ihe neutralizing ability of the affinity 
puritled antibodies raised against recombinant toxin B proteins purified in (a) above. As well, 
a 4X IgY PEG preparaiion from a second independent immunization utilizing the pPBI75()- 
2360 antigen with l-reunds adjuvant was tested for in vivo neutralization. The results are 
shown in fable 31. 

fhe results shown in Table 31 demonstrate that: 
I) as shown in Example 19 and reproduced here. 1.5 mg of affinity purified 

antibody from pMBI 750-2360 immunized hens using Freunds adjuvant does 
noi neutralize loxin B in vivo. However. 300 Mg of alfinit\ purified antibody 
from similarly immunized hens utilizing (Jerbu adjuvant demonstrated complete 
neutralization of loxin B in vivo. This demonstrates that Cierbu adjuvant, in 
addition to enhancing the liter of antibodies reactive to ihe pMB 1 750-2360 
antigen relative to l-reunds adjuvant (demonstrated in (a) above), also enhances 
the yield of neutralizing antibodies to this antigen, greater than 5 fold. 
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Complete in vivo neutralization of toxin B was obsen'ed witii 1 .5 nig of 
affinity purified antibody from liens immunized with pPB 1 750-2360 antigen, 
but not with pMB 1750-2360 antigen, when Frcund.s adjuvant was used. This 
demonstrates, using standardized toxin B repeat-.spccillc antibody 
concentrations, that neutralizing antibodies were induced when pPB 1 750-2360 
but not pMB 1750-2360 was used as the antigen with Frcunds adjuvant. 
Complete in vivo neutralization was observed with 300 Mg of pMBl 750-2360 
(Oerbu) antibody, but not with 300 Mg of pPBl 750-2360 (I-rcunds) antibody. 
Thus the pMBI 750-2360 ((icrbu) antibody lias a higlicr liter of neutralizing 
antibodies than the pPBI 750-2360 (Frcunds) antibody. 
Complete neutralization of toxin tt was observed using 300 f.ig ol C I B 
antibody (affinity purified (AP)| but not 100 Mg CTB antibody (AP oi PF.G 
prep). This demonstrates that greater than 100 ^ly of toxin B repeat-specific 
antibody (aiui-Cl B) is necessary to neutralize 25 (.ig toxin B in vivn in this 
a.ssa>. and that aftinity purified antibodies specific to the loxin B repeat interval 
neutralize toxin 13 as clfcctiNely as ihe PEP prep of IgY raised against the 
entire CTB protein (siiovvn in this assay). 

As uas observed with the initial pPBI 750-2360 (IgY) PL(i preparation 
(l.xample l<)). complete ncutraii/jiion was observed with a IgY PKG 
preparation i.solated from a second independent group of pPB 1 750-2360 
(Frcunds) immunized hens. This demonstrates that neutralizing antibodies are 
reproducibly produced when hens are immunized with pPB 1 750-2360 protein 
utilizing Freunds adjuvant. 
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TABLE 31 



In vivo Neutralization Of Toxin B Using Aftlniiy Piiritled Antibodies 



Treatment Group' 


Number Animals Alive" 


Number Animals Dead" 


Preimmunc' 


0 


5 


CTB (300 ng)- 


5 


0 


CTB (100 y\^y 


1 


4 


pMU 1750-2360 (G) (5 nm)- 


5 


0 


pMB 1750-2360 «i) (1.5 mg)' 


5 


0 


pMB 1750-2360 (G) (300 ^ig)- 


5 


0 


pMB 1750-2360 (F) (1.5 nig)* 


0 


5 


pPB 1750-2360 (D (1.5 ingr 


5 


0 


pPB 1750-2360 ((-) (300 Mg)" 




4 


CTB (100 yxvL) 






pPU 1 750-2360 (F) (500 ay) 


5 


0 



( i/i/fidh- io\in R (CTB) (Tech Lab) ai lethal conccniraiion li) hamsters (25 MJi) was added li> 
the aniibody {aiiiouni uF specific aiuibody is indicated) and incubated lor cuie hour at 37^C. 
Alter incubation, this mixture was injecicd IP into hamsters (I '5 total mix iniecied per 
hamstcn. Fach treatment group received toxin premixed with aniib<>d\ raised ai:ninst the 
iiulicaied protein ((j uerbu adjuvant. F^Freunds adjuvam). ' indicates the anlibod) was a 4X 
IgY PCd piep; - imlieates the aniibody was altmiiy purilled on a pPBI 850-23()() resm: and 
indicates that the aniibody was a IX IgY PflG prep. 

1 he numbers in each group represent numbers of hamsters dead or alive. 2 hrs post IP 
administration of loxin aniibody mixture. 

KXAMPLE 21 

Diagnoslic liiuynic Ininiunoassays lor C \ ilif/icilc Toxins A And B 

I'hc abilitN ol' ihc recombinant loxin proteins and antibodies raised Uifainsi these 
recombinant proteins (described in the above examples) to form the basis ofdiagnoslic assays 
for the detection of clostridial loxin in a sample was examined. Two ininuinoassay lormais 
were tested to ciuaniilaiively delect ('. clij/idic loxin A and loxin li from a hioloyical 
specimen. The llrst lormai involved a compeiiii\c assay in which a llxed amount of 
recombinani loxin A or B was immobilized on a solid support (c..*:., microliter piaic wells) 
followed by the addition of a loxin-conlaining biological specimen mixed with affinity- 
purified or PHG fraclionaled antibodies against recombinani loxin A or l\. If loxin is present 
in a specimen, this loxin will compete with ihe immobilized recombinani loxin protein for 
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binding lo the anti-recombinant antibody liiereby reducing the signal obtained following the 
addition of a reporter reagent. The reporter reagent detects the presence of antibody bound to 
the immobilized toxin protein. 

In the second format, a sandwich immunoas.say was developed using affinity-purified 
antibodies to recombinant toxin A and B. The amnity-purificd antibodies to recombinant 
toxin A and B were used to coal microliter wells instead of ihe recombinant polypeptides (as 
was done in the competitive assay format). Biological samples containing toxin A or B were 
then added to the wells followed by the addition of a reporter reagent to delect the presence 
of bound toxin in the well. 

a) Cumpctitivc iminunoa.ssay For The Detection Of C difficile 
Toxin 

Recombinant toxin A or B was attached to a solid support by coating 96 well 
microliter plates with the toxin protein at a concentraiion of I^Li/ml in PBS. The plaies were 
incubated overnight at 2-8«'C. The following morning, the coating .solutions ucrc removed 
and Ihc remaining protein binding sites on the wells were blocked by filling each well with a 
l>BS .solution containing 0.5% BSA and 0.05% Tvvcen-20. Native ( Jiftkilv toxin A or B 
(Tech f.ab) was diluted u, 4 Mg/ml in .stool extracts from healthy Syrian hamsters (Sasco). 
The stool extracts \vere made by placing fecal pellets in a 15 ml centrifuge tube: PBS was 
added at 2 nil/pcllei a.id the tube was voriexcd to create a uniform suspension. The tube was 
then ccnirifugcd ai 2000 rpm lor 5 min at room temperature. I he .supernatant was removed: 
ihis emprises ihe stool extract. I- ifty ^1 of the hamster stool extract was pipetted into each 
well of the microliter plates to serve as the diluent for serial dilutions of the 4 Mg/ml toxin 
•samples. One hundred nl of the toxin .samples at 4 ^ig/ml was pipetted into the first row of 
wells in the niicrotiier plate, and 50 m1 aiiquots were removed and diluted serially down the 
plate in duplicate. An equal volume of affinity purified anti-recombinant toxin antibodies (1 
ng/vvell of anti-pMA 1870-2680 antibody was used for the detection of toxin A: 0.5 ng/well of 
anti-pMBI750-2360((ierbu) was used for the detection of toxin B] were added to appropriate 
wells, and the plates were incubated at room temperature for 2 hours with gentle agitation. 
Wells .serving as negative control contained antibody but no native toxin to compete for 
binding. 

Unbound toxin and antibody were removed by washing the plates 3 to 5 times with 
PBS containing 0.05% Tween-20. Following the wa.sh step. 100 j.l of rabbit anti-chicken IgG 
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antibody conjugated to alkaline phosphatase (Sigma) was added to each well and the plates 
were incubated for 2 hours at room temperature. The plates were then washed as before to 
remove unbound secondary antibody. F-reshly prepared iilkalinc phosphatase substraie { I 
mg/ml p-niirophenyl phosphate (Sigma) in 50 mM Na.CC).. pl-l 9.5: 10 mM MgCiO was 
added to each well. Once sufficient color developed, the plates were read on a Dynaiech 
MR70() microtiter plate reader using a 410 nm filter. 

The results are summarized in Tables 32 and 33. |-or ihc results shown in Tabic 32. 
the wells were coated with recombinant toxin A protein (pM A 1870-2680). The amount of 
native toxin A added (present as an addition to M)lubilizcd hamster stool) to a given well is 
indicated (0 to 200 ng). Antibody raised against the recombinant toxin A protein. pMAI870- 
2680. was affinity purified on the an affinity column comaininti pPA 1870-2680 (described in 
l-xample 20). As shown in Table 32. the recombinant toxin A protein and airmity-purified 
antitoxin can be used for the basis of a competitive immunoassay for the detection of toxin A 
in biological samples. 

Similar results were obtained using the recombinant toxin II. pPB I 750-2360. and 
antibodies raised against pMBl 750-2360((ierbu). For the results shown in fable 33. the 
wells were coated with recombinant toxin B protein (pPBl 750-2360). fhc amount of native 
toxin B added (present as an addition to .solubilizcd hamster stool) to a given well is indicated 
(0 10 200 ng). Antibody raised against the recombinant toxin B protein. pMBI750- 
236()((ierbu). was affinity purified on the an affinity column containing pl*B 1 850-2360 
(described in Lxample 20). As shown in fable 33. the recombinant toxin B protein and 
affinity-purified antitoxin can he used for the ba.sis of a competitive immunoassay for the 
detection of toxin U in biological .samples. 

In this competition assay, the reduction is considered significant over the background 
levels at all points: therefore the assay can be used to detect .samples containing less than 12,5 
ng toxin A/well and as liule as 50-100 ng toxin B/well. 
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TABLE 32 

Compeiiiive Inhibition Of Anii>C. tiifftdk Toxin A Ry Native Toxin A 



Toxin A/Well 


0D,,„ Readout 


200 


0.176 


100 


0.253 


50 


0.240 


25 


0.259 


12.5 


0.309 


6.25 


0.367 


3.125 


0.417 


0 


0.590 


TABLE 33 

Compciiiivu Inliihiiion Of Anti-C. Ji/fid/c Toxin B Bv Naiivc Toxin 


iiLi Toxin n/Well 


0D„., Rcadoul 


200 


0.392 


100 


0.566 


50 


0,607 


25 


0.778 


12.? 


0970 


6.25 


0.^)02 


.VI25 


1.040 


0 


1.055 



These competitive iniiibition assays demonsnrale that native C. diffidic toxins and 
reeombinani (' Jifftdlc loxin proteins can compete for binding to antibodies raised against 
recombinant ( difficik' toxins demonstrating thai these anii-recombinant toxin antibodies 
provide ciTeciive diagnostic reagents. 

b) Sandwich Immunoassay For The Detection Of C difficiie 
Toxin 

/Xrilniiy-piirified antibodies against recombinant loxin A or toxin B were immobili/ed 
to 96 well microliter plaies as follows. The wells were passively coaled overnight at 4**C' 
with aninity purified aniibodies raised against either pMA 1 870-2680 (toxin A) or pMBlTSO- 
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2360(Gcrbu) (loxin B). The antibodies were affinity purified as described in Example 20. 
Tlie antibodies were used at a concentration of I ng/ml and 100 mI was added to each 
microliter well. The wells were then blocked with 200 nl of 0.5% BSA in PBS for 2 hours 
at room temperature and the blocking solution was then decanted. Stool samples from healthy 
Syrian hamsters were rcsuspended in PBS, pll 7.4 (2 ml I'BS/stool pelloi was used lo 
resuspend the pellets and the sample was ccmriliigcd as described above). The stool 
suspension was then spiked with native C. Jiflkih to.xin A or B ( Tech Lab) at 4 ng/ml. The 
stool susijensions containing toxin (either toxin A or toxin B) were then serially diluted two- 
fold in stool suspension without toxin and 50 mI was added in duplicate to the coated 
microliter wells. Wells containing stool suspension without toxin served as the negative 
comrol. 

The plates were incubated for 2 hours at room temperature and then were washed 
ihrce limes with I'BS. One hundred mI of dther goat anti-naiivc toxin A or i-oai anti-native 
loxin B (Tech Lab) diluted L I 000 in PBS containing 1% BSA and 0.05% Twecn 20 was 
added to each well . The plaies were incubated for another 2 hours at room temperature. 
I he plates vxcre tiicn washed as before and 100 ^il of alkaline pliosphata.sc-c(>njugatcd rabbit 
anti-goat lg(i K appel. Durham. N.C.) was added at a dilution of 1:1000. I he plaies were 
incubaicd for another 2 hours at room temperature. The plates were washed as before then 
developed by the addition of 100 ^I/well of a substrate solution containing 1 mg/nil p- 
nitrophcnyl phosphate (Sigma) in 50 mM NaXO,. pH 0.5: 10 mM MgCI,. The absorbanee of 
each ucll was measured using a plale reader (l)ynaiech) at 410 nm. The as.say results are 
shown in fables 34 and 35. 

TABLE 34 



ng Toxin A/Wcll 


. .^i.ii.j, 1 XL'S III 

OOj,„ Rcadoui 


200 


0.9 


100 


0.8 


50 


0.73 


25 


0.71 


12.5 


o.5y 


6.25 


0.421 


0 


0 
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TABLE 35 



(' c/if/ici/c Toxin B Deteciion In Stool Using Amnitv-Puritled Antibodies Aiiainsi Toxin B 


ng Toxin B/Well 


C)D|,„ Readout 


200 


1.2 


100 


0.975 


50 


0.887 


25 


0.846 


12.5 


0.65 1 


6.25 


0.451 


0 


0.004 



The re.sulis .shown in Tables .14 and 35 show thai antilxidics rai.secl against recombinant 
it).Nin ,\ and to.xin B Iraumenis can be u.sed to delect the presence olT. Ji/ficilc toxin in stiwl 
samples. I hcse antibodies lorm the basis lor a sensitive sandwich immunoas.say which is 
capable ol detectini. as litllc as 6.25 ng of either loxin A or B in a 50 ^il stool sample. As 
shown abiiNc in Tables 34 and 35. the background for this sandwich immunoassay is 
extremely low : ihercfore. the sensiti\ ity of this assay is much lower than 6.25 ng loxinAvell. 
It is likely that toxin levels of 0.5 to 1.0 pg/wcll could be detected by this assay. 

The results shown above in Tables 32-35 demonstrate clear utility of the recombinam 
reagents in ( '. iliffiviU> toxin detection systems. 

EXAMPLE 22 

C onstruction And I-,\pre.ssion Of ('. hoiuUnum C Fragment l usion Proteins 

Hie ('. hoiiiiimim type A neurotoxin gene has been cloned and sequenced [Thompson, 
el ul.. lim. J. Biochem. 189:73 (1990)]. The nucleotide sequence of the toxin gene is 
available from the EMBL/GenBank .sequence data banks under the accession number X52066; 
the nucleotide sequence of the coding region i.s listed in SlIQ ID N0.27. I he amino acid 
sequence ol'the boiulimmi type A neurotoxin is listed in .SliQ ID N():28. Ihe type A 
neuroto.xin gene is synthesized as a single polypeptide chain which is proces.sed to form a 
dimei- compo.sed of a light and a lieavy chain linked via di.sulfide bonds, ( he 50 kD carboxy- 
terminal portion of the heavy chain is referred to as the C fragment or the 1 1,, domain. 

Previous attempts by others to express polypeptides comprising the C fragment of C.'. 
homlimim type A toxin as a native polypeptide W.};.. not as a fusion protein) in E. coli have 
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been unsuccessful [H.F. LaPenotiere. ei aL in Boiutimm and Tetanus Neuroioxins. DasGupta. 
Ed.. Plenum Press. New York (1993). pp. 463-466]. I-xpression of the C fragment as a 
fusion wiih the £. coli MBP was reporied to result in the production of insoluble protein 
(H.F. LaPenotiere. et al,. supra). 

In order to produce soluble recombinant C fragment proteins in E. coli. fusion proteins 
comprising a synthetic C fragment gene derived from liie C. hotidinum type A toxin and 
either a portion of the C ilifficila toxin protein or the MBP were constructed. This example 
involved a) the construction of plasmids encoding C iVagment lusion proteins and b) 
expression of ( ' hondinum C IVagnienl fusion proteins in /:. coli. 

a) Construction Of Plasmids Encoding C Fragment Fusion 
Proteins 

In Rxample 1 1. ii was demonstrated thai the JiUkilc toxin A repeal domain can be 
elllcienlly expressed and purified in £. coli as either native (expressed in ihc pLT 23a vector 
in clone pPA I 870-2680) or fusion (expressed in the pMALc vector as a fusion with the 
coli MW in clone pMA 1870.2680) proteins. l-usion proteins comprising a fusion between 
ihe MBP. portions of the C. ilitficile toxin A repeat domain (shown to be expressed as a 
soluble fusion protein) and the C fragment of the C. hotulimnn type A toxin were consirucied. 
A fusion protein comprising the C fragment of the (\ hoiulinum type A toxin and the MBP 
was also constructed, 

l-iyurc 25 provides a schematic representation of the botulinal fusion proteins along 
with the donor constructs containing the ( \ lii/fkilc toxin A sequences or ( boiulmum V 
fragment sequences which were used to generate the botulinal fusion proteins. In Figure 25. 
the solid boxes represent C. Jifjficile toxin A gene sequences, the open bo.xes represent i \ 
hotulinwn C fragment sequences and the solid black ovals represent the culi MBP. When 
the name for a restriction enzyme appears inside parenthesis, this indicates that the restriction 
site was destroyed during construction. An asterisk appearing with the name for a restriction 
enzyme indicates that this restriction site was recreated at the cloning junction. 

In Figure 25. a restriction map of the pMA1870-2680 and pPAl 100-2680 constructs 
(described in f-xample 11) which contain sequences derived from the ( difficile toxin A 
repeat domain are shown: these constructs were used as the source of difficile toxin A gene 
.sequences for the construction of plasmids encoding fusions between the < *. hotidinum C 
fragment gene and the i \ difficile toxin A gene. The pMA 1 870-2680 expression construct 
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cNpresses high levels of soluble, intaci fusion protein (20 mg/liier culture) which can be 
affinity purified on an aitiylose column (purification described in Example 1 Id). 

The pAllerBot construct (Figure 25) was used as the source off.', hoiulinum C 
fragment gene .sequences for the botullnal fusion proteins. pAlterBol was obtained from J. 
Middlebrook and R. Lemley at the U..S. Department of Defense. pAlterBot contains a 
synthetic C. hnitilinum C fragment inserted in to the pALTER-llK; vector (Promega). This 
synthetic V fragment gene encodes the same amino acids as does the naturally occurring C 
fragment gene. The naturally occurring C fragment sequences, like most closiridial genes, are 
cxtremel) .A'T rich (■niompst)n ci ciL. supra). This high A/T content creates expression 
difficulties in /f. coli and yeast due to altered codon usage frequency and fortuitous 
ITolyadcnylation sites, respectively. In order to improve the expression of C fragment proteins 
in /•;. coli. u sx nthelic version of the gene was created in which the non-preferred codons were 
replaced with preferred codons. 

The nucleotide sequence of the ( ' hniulimim C fiagmem gene sequences eontaincti 
within pAlterUot is listed in SLQ ID NO:22. The first six nucleotides (ATGGCT) encode a 
methionine and alanine residue, respeciiveiy. These iwo amino acids result from the insertion 
of the ( ■ hoitilinum C fragment sequences into the pALTHR® vector and provide the initiator 
methionine residue. The amino acid sequence of the C. hoiiilimini C fragment encoded by the 
.sequences contained within pAltcrBot is li.sted in SCQ ID N0:2.1. I he first two amino acids 
(Met Ala) are encoded by vector-derived sequences, l-rom the third amino acid residue 
t>iuvaid (.Argl. the amino acid sequence is identical to that found in the C. luiiiilimim type A 
loxin gene. 

The pMA 1870.2680. pPAl 100-2680 and pAlterBot constructs were used as progenitor 
plasmids to make expression constructs in which fragments of the C. Jijf/icUe to.xin A repeat 
domain were expressed as genetic fusions with the ('. hoiulinum C fragment gene u.sing the 
pMAl.-c expression vector (New Hngland BioLabs). The pMAL-c expression vector 
generates fusion proteins which contain the MBP at the amino-terminal end of the protein. A 
construct. pMIJol. in which the C. hoiulinum C fragment gene was expressed as a fusion with 
only the MBP was constnjcted (Figure 25). Fusion protein expression was induced from E. 
toll strains harboring the above plasmids. and induced protein was affinity purified on an 
aniylose resin column. 
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i) Construction Of pBlucBot 

In order to facilitate the cloning of the C hoiulinum C fragment gene sequences into a 
number of desired constructs, the bolulinal gene sequences were removed from pAltcrBot and 
were inserted into the pBluescript plasmid (Slratagene) lo generate pBlueBoi (Tigure 25). 
pBlucBot was conslruclcd as follows. Bacteria containing the pAllcrBoi plasmid were grown 
in medium containing tetracycline and plasmid DNA was isolated using the QIAprep-spin 
Plasmid Kit (Qiagcn). One microgram of pAllerBoi DNA was digested with \'co\ and the 
resulting 3' recessed sticky end was made blunt using the Klenow iVagment of ONA 
polymerase 1 (here after the Klenow fragment). The pAlterBot DNA was then digested with 
///m/lll to release the bolulinal gene sequences (the Bot insert) as a blunt (filled SaA site)- 
/7//7C/III fragment. pBluescript vector DNA was prepared by digesting 200 ng of pBlucscripi 
DNA with Snm\ and HincJlU, The digestion products from both plasmids were resolved on an 
agarose gel. The appropriate iVagments were removed IVom the gel. mixed and purified 
uiili/.ing the Prep-a-Ciene kit (UioRad). The eiutcd DNA was then ligaled using 14 DNA 
ligase and used to transform competent DH5a cells (Cjibco-BRL). Most cells were made 
competent for transformation using the calcium chloride protocol ol* Sambrook ci aL. supra at 
I.82-1.S7>. Recombinant clones were i.solaled and confirmed by restriction digestion using 
standard recombinant molecular biology techniques (Sambrook ci uL supra). Ihe resultant 
clone. pBlueBot. contains several useful unique restriction sites Hanking the Bot insert {i.e.. 
the ( hoiulinum C fragment sequences derived from pAllerBol) as shown in I'igure 25. 

ii) C:onstruction Of C\ difficile / C baUilinum I 
MBP Fusion Proteins 

C\)nstrueis encoding fusions between the L\ ifijficilc toxin A gene and the C. hoiulinum 
C fragment gene and the MBP were made utilizing the same recombinant DNA methodology 
outlined above: these fusion proteins contained varying amounts of the ( lii/ficilc toxin A 
repeat domain. 

The pMABol clone contains a 2.4 kb insert derived from the (' (tif/icilc toxin A gene 
fused to the Bot insert {i.e. the C hoiulinum C IVagment sequences derived from pAlierBot). 
pMABot (I'igure 25) was constructed by mixing gel-purified DNA from SoiMHimHW digested 
pBlueBot (the 1.2 kb Bot fragment). Spe\INof\ digested pPAl 100-26X0 (the 2.4 kb dUficile 
toxin A repeat fragment) and XhaUlIindlll digested pMAl.-c vector. Kecombinani clones 
were isolaled. confirmed by restriction digestion and purified using the QIAprcp-spin Plasmid 
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Kit (Qiagen). This clone expresses the toxin A repeats and the botulinal C fragment protein 
sequences as an in-frame fusion with the MBP. 

The pMCABot construct contains a 1.0 kb insert derived from the C. difficile toxin A 
gene fused to the Bot insert (i.e. the C. Imtulimim C fragment sequences derived from 
pAllcrBot). pMCABot was constructed by digesting the pMABot clone with EcoR\ to 
remove the 5" end of the C. ilifficilc toxin A repeat (see Figure 25. the pMAl.-c vector 
contains a Ea)K\ site to the C. Jiffidle insert in the pMABot clone). The restriction sites 
were niied and rcllgated together alter gel purification. The resultant clone (pMCABol. 
l-igure 25) generated an in-frame fusion between the MBP and the remaining y portion of the 
( ■ dUficilu toxin A repeat domain lu.sed to the Bot gene. 

i he pMNABol clone contains the I kb SpeMEcoKl (lilled) fragment from the C. 
dUficile toxin A repeat domain (derived from clone pPAl 100-2680) and the i.2 kb C. 
homtimim C fragment gene as a \co\ (filled)////m/iil fragment (derived from pAltcrBot). 
I hcse two Iragmcnts were inserted into the pMAL-c vector digested with XlntMHimm. The 
two in.seri fragmcni.s were generated by digestion of the appropriate plasmid witli EloKX 
(pPA! IO()-26«0) or Scn\ (pAlierBol) (bllowcd by treatment with the Klenou fragment. After 
ia>almcnt with the Klenow fragment, the pla.smids were digested with the second enzyme 
(either Spd or HincaU). All three fragments were gel puritied. mixed and Prep-a-Gene 
purified prior to ligation, l-ollowing ligation and transformation, putative recombinant.s were 
analyzed b\' restriction analysis: the EcoRl site was found to be regenerated at the fusion 
Junction, as was predicted for a fusion between the filled EcoRl and Nco] sites. 

A construct encoding a fusion protein between the botulinal C fragment gene and the 
MBP gene \^as constructed (/.c. this fusion lacks any C. difficile toxin A gene .sequences) and 
termed pMUot. I he pMBot construct was made by removal of the C. difficile toxin A 
sequences from the pMABot construct and fusing the C fragment gene .sequences to the MBP. 
niis was accomplished by digestion of pMABot DNA with .V/t/I (located in the pMALc 
polyiinkcr to the Xlial site) and A7»oI (located 3" to the Nod .site at the toxA-Bot fusion 
junction), lilling in the Xhu\ site using the Klenow fragment, gel purifying the desired 
1-e.striction fragment, and ligating the blunt ends to circularize the plasmid. I-ollowing ligation 
and transformation. puiati\e recombinants were analyzed by restriction mapping «if the Bot 
iascrt (i.e. the ('. lunuliiuini C fragment .sequences). 
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b) Expression Of C boiulinum C Fragment Fusion Proteins In 
£• coli 

Large scale (1 liter) cultures of the pMAL-c vector, and each recombinant construct 
described above in (a) were grown, induced, and soluble protein fractions were isolated as 
described in Example 18. 'F he soluble protein extracts were chromatograplied on amylosc 
affinity columns to isolate recombinant fusion protein. The purified recombinant fusion 
proteins were analyzed by running samples on SDS-PAGE gels followed by Coomassic 
staining and by Western blot analysis as described (Williams ci uL (1994) supraY In brief, 
extracts were prepared and chromalographcd in column buffer (10 mM NaPOa. 0.5 M NaCl. 
10 mM p-mercaplocthanoL pH 7.2) over an amylose resin (New England Biolabs) column, 
and clutcd with column buffer containing 10 mM maltose as described [Williams, el ui 
( 1994). supra]. An SDS-PA(jF. gel containing the purified protein samples stained with 
Coomassic blue is shown in Figure 26. 

In l igLire 26. the ibilowing samples were loaded, l.anes 1-6 contain protein purified 
from E. coll containing the pMAL-c. pPA 1 870-2680. pMABot. pMNABoi. pMCABot and 
pMBoi piasmids. respectively, l.ane 7 contains broad range molecular weight protein markers 
(BioRad). 

The protein samples were prepared for electrophoresis by mixing 5 ^il of eluted protein 
w ith 3 |.il o!'2X SDS-PAGE sample buffer (0.123 mM i ri.s-HC'l. pi I 6.8. 2 mM ED TA. 6% 
SDS. 20% glycerol. 0.023% hromophenol blue: (i-mercapioelhanol is added to 5% before 
use). F he samples were healed to 93%' for 5 min. then cooled and loaded on a 7.5% agarose 
SlJS-PACil'. gel. liroad range molecular weight protein markers were also loaded to allow 
estimation of the MW of identified fusion proteins. After electrophoresis, protein was 
detected generally by staining the gel with Coomassic blue. 

In all cases the yields were in excess of 20 mg fusion protein per liter culture (.see 
Table 36) and, with the exception of the pMCABol protein, a high percentage o' e.. greater 
than 20-50% of total eluted protein) of the eluted fusion protein was of a MW predicted for 
the full length fusion protein (I-igure 26). It was estimated (by visual inspection) that less 
than 10% of the pMCABot fusion protein was expressed as the full length fusion protein. 
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TABLE 36 



(""nii^t riirt 
V- v/iiaii u^i 


Yield (mg/liicr of Culiurc) 


Perceniage Of Total 
Soluble Protein 


pMABol 


24 


5,0 


pMCABol 


34 


5.0 


pMNABot 


40 


5.3 


pMBol 


22 


5.0 


pMA 1870-2680 


40 


4.8 



riicse results demonstrate that high level expression of intact (\ bntitlinum C 
IVaument.'( ; diffkilc lo.xin A fusion proteins in £. col\ is feasible using the pMAL-c 
expression s>'siem. These results are in contrast to those reported by H, LaI>cnoticrc. tv «/. 
I IW";). supra. In addition, these results show that it is not necessary to fuse the bolulinal C 
tragmeni gene to the (.'. UiffkUc toxin A gene in order to produce a soluble fusion protein 
using the pMAI.-c system in tW/. 

In order to determine whether the abovc-dcscribcd bolulinal fusion proteins were 
recognized b\ anti-C. boiui'mwn toxin A antibodies. Western blots were performed. Samples 
containing aflinily-purifled proteins from E. colt containing the pMABoi. pMCABoL 
pMNABoi. pMBol. pMA 1 870-2680 or pMALc plasmids were analyzed. SDS-PAGH gels 
(7..^% acr\ lamide) were loaded with protein samples purified from each expression construct. 
After electrophoresis, the gels were blotted and protein transJ'er was conllrmed by Ponceau S 
staining (us described in txample I2b|. 

I'ollowing protein transfer, the blois were blocked by incubation for I hr at 20°C in 
blocking buffer |PBS r (PBS containing 0.1% Tween 20 and 5% dry milk)l. The blots were 
then incubated In 10 mi of a solution containing the primary antibody: this solution comprised 
a 1/500 dilution of an anli-C. bondmum toxin A IgY PEG prep (described in Example 3) in 
blocking buffer. The blots were incubated for I hr at room temperature in the presence of the 
primarx antibody. The blots were washed and developed using a rabbit anli-chicken alkaline 
phosphata.se conjugate (Boehringer Mannheim) as the secondary antibody as follows. The 
rabbit anti-chicken antibody was diluted to I ^g/ml in blocking buffer (10 ml fmal volume 
per blot) and the blots were incubated at room temperature for I hour in the presence of the 
secondary antibody. The blots were then wa.shed successively with PBS T. BBS-Twecn and 
50 niM Na.CO,. pH 9.5. The blots were then developed in freshly-prepared alkaline 
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phosphatase subsiraie buffer (100 Mg/nil niiro blue icirazolium, 50 Mg/ml 5-bromo-chloro- 
iiuiolylphosphaie, 5 mM MgCI, in 50 mM NaXO,. pH 9.5). Developmeni was slopped by 
flooding the blots wiih distilled water and the blots were air dried. 

This Western blot analysis detected anii-C. bondinum toxin reactive proteins in the 
pMABot. pMCABot, pMNABot and pMBot protein samples (corresponding to the predicted 
lull length proteins ideniilied above by Coomassic staining in Figure 26), but not in the 
pMA II 00-26X0 or pMALc protein samples. 

These results demonstrate that the relevant fusion proteins purilied on an amylose rcsin 
as described above in section a) contained inimunoreactive Choutlinwn C fragment protein as 
predicted. 

rXAMPLE 23 

Ciencralion Of Neutralizing Antibodies 
By Nasal Administration Of pMBot Protein 

The ability ufthe recombinant boiulii^al toxin proteins produced in bxample 22 to 
stimulate a systemic immune response against botulinai toxin epitopes was assessed. I his 
example invoked: a) the evaluation of the induction of serum IgCj liters produced by nasal or 
oral administratit)n of botulinai toxin-containing Ji/ficitc toxin A fusion proteins and b) 
the in vivo neutralization of C houilinum type A neurotoxin by anti- recombinant (* 
bomlinwv C* fragment antibodies. 

ii) Rvaluation Of The Induction Of Scrum IgC; Titers Troduccd 
By NamI Or Oral Administration Of Botulinai Toxin- 
Containing C difficile Toxin A Fusion Proteins 

Six groups containing five 6 week old CF female rats (Charles River) per group wore 
immunized nasally or orally with one of the following ihree combinations using protein 
prepared in lixample 22: (1) 250 Mg pMBol protein per ral (nasal and oral); 2) 250 
pMABoi protein per ral (nasal and oral); 3) 125 Mg pMBoi admixed with 125 ^g pMAI«70- 
2680 per ral (nasal and oral). A second set of 5 groups containing 3 CF female rats/group 
were immunized nasally or orally with one of the following combinations (4) 250 ng 
pMNABol protein per rat (na.sal and oral) or 5) 250 ^g pMAi.-c protein per rat (nasal and 
oral). 
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The fusion proteins were prepared for immunization as follows. The proteins (in 
column bulTer containing 10 mM maltose) were diluted in 0.1 M carbonate buffer, pH 9.5 and 
administered orally or nasally in a 200 mI volume. The rats were lightly sedated with ether 
prior to administration. The oral dosing was accomplished using a 20 gauge feeding needle. 
The nasal dosing was performed using a P-200 micro-pipettor (Gilson). The rats were 
boosted 14 days after the primary immunization using the techniques de.scribed above and 
were bled 7 days later. Rats from each group were lightly etherized and bled from .he tail. 
The blood was allowed to clot at 37°C for 1 hr and the serum was collected. 

The serum from individual rats was analyzed using an FLISA to determine the anii-C 
hon,n,uau type A lo.Kin IgG serum titer. The ELISA protocol used is a modificatio.i of that 
described in li.vample 13c. Briefly. 96-well microliter plates (1-alcon. Pro-Bind Assay I'lates) 
were coated with C. hoiu/hmm type A to.void (prepared as described in Example 3a) by 
placing 100 Hi volumes of C. hoiulhmm type A toxoid at 2.5 Mg/ml in PBS containing 
0.005% thimero.sal in each well and incubating overnight at 4°C. The next morning, the 
coating su-spensions were decanted and all wells were washed three times using PB.S. 

In order to block non-specillc binding sites. 100 mI of blocking .solution fO.5% BSA in 
PBS| was then added to each well and the plates were incubated for I hr at 37°C. The 
blocking .solution was decanted and duplicate samples of 150 mI of diluted rat .serum added to 
the nr.st well of a dilution series. The initial testing serum dilution was 1:30 in blocking 
•solution containing 0.5% Tween 20 followed by 5-fold dilutions into this solution. Thi.s was 
accomplished h.v serially transferring 30 ^.1 aliquots to 120 mI blocking solution containing 
0.5% Tween 20. mi>cing. and repeating the dilution into a fresh well. After the final dilution. 
30 Ml was removed from the well such that all wells contained 120 h! final volume. A total 
of 3 such dilutions were performed (4 wells total). The plates were incubated 1 hr at 37'C. 
following this incubation, the .serially diluted samples were decanted and the wells were 
washed six limes using PB.S containing 0.5% Tween 20 (PBST). To each well. 100 mI of a 
rabbit anti-Rat IgG alkaline phosphatase (Sigma) diluted (l/IOOO) in blocking buffer 
containing 0.5% Tween 20 was added and the plate was incubated for I hr at 37°C. The 
conjugate .solutions were decanted and the plates were washed as described above, substituting 
50 mM Na,CO;. pH 9.5 for the PBST in the final wash. The plates were developed by the 
addition of 100 pi of a solution containing I mg/ml para-nitro phenyl phospliate (Sigma) 
dissolved in 50 mM Na.CO,. 10 mM MgCI,. pH 9.5 to each well, and incubating the plates at 
r.)om temperature in the dark for 5-45 min. The absorbcncy of each well was measured at 
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410 nm using a Dynaiech MR 700 plate reader. The results are summarized in Tables 37 and 
38 and represent mean serum reactivities of individual mice. 

rVBIT. 37 



iXrtcmiinmion Ol' Anti-C honilmum Type A hwin Scnim Igti Titers I'cilhminj! Inmiiini/niion With ('. houtUitmn C TrajimcnU Duiainin;! 

t-usuin I'rnicins 



Ktniic oi In 


iiTiuni/,alinn 


Nasat 


Oral 


liniminiijivn 


IMMONF 


pMlkn 


pMHot & 
pMAIK7()- 
2680 


pMAllot 


pMlliH 


pMAI«70. 


pMAbm 


Ditiiiion 








1 




i.tuo 


1 xm 


0 m) 


0 \*H\ 


0 0X0 


(1.120 


1 1 5(1 


0017 




O.540 


IH)22 


0 (170 


0.020 


0 02? 


1 750 




0.2X0 


0.2&l» 


O.UlO 


0 020 


0 \)\{) 


(1 014 


l::»75() 


iUMM 


0.0K4 


i)Xm 


0 am 


0 (KM) 


0 010 


0 007 


" Kais 1 csicd 




5 


y 


5 




1 


1 



NiuiibL-rs rcprc'iciil ilic a%crasc v»tuc.\ (thiauicil Ironi luo lil.tSA plates, vtnndardi/cd uitli/iiit: ilur prcininumc control 



TABLE 38 



Deierminaiion OfAmi-C'. hotuhnum Type A foxin Senini luCi Tilers 
Following immunizniion Willi (\ hotulinum C Fnmincni-Containinu Fusion Proteins 



Route of Immunization 


Nasal 


Oral 


hnmunoiicn 


PRE-IMMUNE 


pMBol 


pMABoi 


pMNABoi 


pMNAHol 


Oiluiion 




1:30 


0.040 


0.557 


0.010 


0.015 


0.010 


I;I5() 


O.OOQ 


0.383 


0.001 


0.003 


0.002 


1:750 


0.001 


O.UO 


0.000 


0.000 


0.000 


l:37.SO 


0.000 


0.040 


0.000 


0.000 


0.000 


« Rais rested 




1 


1 


3 





The above 1:LISA results deinonsiraie that reactivity auainst the botiilinal fusion 
30 proteins was strongest when the route of administration was nasal: only weak responses were 
stimulated when the botulinal I'usion proteins were given orally. Nasally delivered pMbot and 
pMBot admixed with pMA 1 870-2680 invoked the greatest serum IgO response. Hiesc results 
show that only the pMBot protein is necessary to induce liiis response, since the addition ol 
the pMA1870-2680 protein did not enhance antibody response ( l able 37). Placement of 
35 C'. difjftcile toxin A fragment between the MBP and the hoiulimm} C fragment protein 
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dramatically reduced ami-bot IgG titer (see results using pMABot. pMCABot and pMNABot 

proteins). 

This study demonstrates that the pMBot protein induces a strong serum IgG response 
directed against C. hniulinum type A toxin when nasally administered. 

b) In Vivo Neutralteation Of C botulinum Type A Neurotoxin 
By AntI- Recombinant C. botulinum C Fragment Antibodies 

The ability of the anti-C holuUnum type A toxin antibodies generated by nasal 
administration of recombinant botulinal fusion proteins in mts (Example 22) to neutralize C. 
hon.hm,m type A toxin was tested in a mouse neutralization model. The mouse model is the 
an accepted method for detection of botulinal toxins in body lluids and for the evaluation of 
umi-botulinal antibodies [E.J. Schantz and D.A. Kauiicr. .1. As.soc. Off. Anal. Chem. 61:96 
(I WO) and Investigational New Drug (BB-IND-3703) application by the Surgeon General of 
the Department of the Army to the Federal l-ood and Drug Admini.siration|. The anti-C'. 
hiiiulimim type A toxin antibodies were prepared as follows. 

Rats from the group given pMBoi protein by nasal administration were boosted a 
second time with 250 ^g pMBot protein per rat and serum was collected 7 days later. Serum 
from t>ne rat from this group and from a preimmune rat uas tested for anli-C. hnlulmum type 
A toxin neutralizing activity in the mou.se neutralization model described below. 

The I.D.. of a solution of purified C hoiulimim type A toxin complex, obtained from 
Dr. Hric .lohn.son ( University of Wisconsin Madi.son). wa.s determined u.sing the intraperitoneal 
(IIM method of Schantz and Kauiter (J. A.ssoc. Off. Anal. Chem. 01:96 (1978)1 using 18-22 
gram female ICR mice and was found to be 3500 I.D„/ml. The determination of the LD,., was 
performed as Ibllows. A Type A toxin standard was prepared by dissolving purified type A 
toxin complex in 25 mM sodium phosphate buffer. pH 6.8 to yield a stock toxin solution of 
3.15 x 10' LD,„/mg. The 0D,„ of the solution was determined and the concentration was 
adjusted to 10-20 ng/ml. The toxin solution was then diluted 1:100 in gel-phosphate (30 mM 
phosphate, pll 6.4: 0.2% gelatin). Further dilutions of the toxin solution were made as shown 
bcl„vv in I able 39. Txvo mice were injected IP with 0.5 ml of each dilution .shown and the 
mice were observed for symptoms of botulism for a period of 72 hours. 
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TABLE 39 

Determination Of The LD.o Of Purified C howlinum Type A Toxin Complex 



Dilution 


Number Dead At 72 hr 


1:320 


2/2 


t:640 


2'2 


1:1280 


2/2 


1:2560 


0/2 (sick after 72 hr) 


1:5120 


0'2 (no symptoms) 



1^ From the results shown in Tabic 39. ihc loxin tiler was assumed to be between 2560 

LDJml and 5120 LDv/ml (or aboul 3840 LD,,/nil). This value was rounded to 3500 
LDm/hiI Ibr the sake of calculation. 

The amount oi' neutralizing antibodies present in the scrum of rats immunized nasally 
with pMBoi protein was then determined. Serum from two rats boosted with pMBoi protein 

15 as described abt>ve and preimmune serum from one rat was tested as follows. I'he loxin 
standard was diluted 1:100 in gel-phosphate to a linal concentration ol' 350 LD^/n^l- One 
milliliter of the diluted toxin standard was mixed with 25 ^il of serum JVom each of the three 
rats and 0.2 ml of gel-phosphate. The mixtures were incubated at room temperature Ibr 30 
min with occasional mixing. Kach of two mice were injected wilh II* with 0.5 ml of the 

20 mixtures. The mice were observed for signs of bolulism for 72 hr. Mice receiving serum 
from rats immunized wilh pMBot protein neutralized this challenge dose. Mice receiving 
preimmune rat serum died in less than 24 hr. 

The amouni of neutralizing anti-toxin antibodies present in the serum ol' rats 
immunized with pMBot protein v\as ilien quantitaied. Serum antibody titrations were 

25 performed by mixing 0.1 ml of each of the antibody dilutions (see Table 40) with 0.1 ml of a 
1:10 dilution of stock toxin solution (3.5 x 10"* I.D5,/ml) wilh l.O ml of gel-phosphate and 
injecting 0.5 ml IP into 2 mice per dilution. 1 he mice were then observed lor signs oi* 
bolulism for 3 days (72 hr). The results are tabulated in Table 39. 

As shown in Table 40 pMBol serum neutralized ('. hotulinum type A loxin complex 

30 when used at aililulion of 1:320 or less. A mean neulrulizing value ol* 168 lU/ml was 

obtained for the pMBol serum (an lU is dellned as 10.000 mouse LD,,,)- fliis value translates 
to a circulating serum tiler of about 3.7 lU/mg of serum prolein. This neutralizing liter is 
comparable to the commercially available bottled concentrated (Connaught Laboratories. Ltd.) 
hor.sc anli-( *. hofulinum antiserum. A 10 ml vial of Connaught antiserum contains about 200 
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mg/ml ofproleinieach ml can neutralize 750 lU ofC hoiulimm type A loxin. After 
administration of one vial to a human, the circulating serum titer of the Connaught 
preparation would be approximately 25 lU/ml assuming an average serum volume of 3 liters). 
Thus, the circulating anti-C hniulirmm liter seen in rats nasally immunised with pMBol 
protein (168 lU/ml) is 6.7 time higher than the necessary circulation titer of anii-C. hotttlinum 
iunibody needed to be protective in humans. 



TABLE 4{) 

Quantitation Of Neutraitzing Antibodies In pMBot Sera 



Dilution 


pMBot* 


Rat 1 


Rat 2 


1:20 


2/2 


2/2 


1:40 


2/2 


2/2 


1:X(> 


2/2 




1:160 


2 '2 


2 '2 


1 :3.20 


2/2** 


2 '2** 


1 :640 


0/2 


0/2 


t:l280 


0/2 


0/2 


1 :25m 


0/2 


0/2 



Numbers rcprescni the number of mice surviviny at 72 hour;; which received scrum taken from 
nils immunized with the pMBoi protein. 

Tliese mice survived but were sick alter 72 hr. 



fhesc results demonstrate that antibodies capable of neutralizing C. hoiulintmi type A 
toxin are induced when recombinant C*. homlimim C fraijmeni tiision protein produced in L 
i'oli is used as an immunogen. 

EXAMPLE 24 

Production Of Soluble C honilinum C i 'ragment 
Protein Substantially F- ree Of hndotoxin Contamination 

r.xample 23 demonstrated thai neutralizing antibodies are generated by immunization 
with the pMBot protein expressed in E. colL These results showed that the pMBot fusion 
protein is a good vaccine candidate. However, immunogens suitable for use as vaccines 
should be pyrogen-free in addition to having the capability ol* inducing neutralizing 
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antibodies. Expression clones and conditions thai facilitate the production of C hoiulmum C 
fVagmeni protein for utililization as a vaccine were developed. 

The example involved: (a) determination of pyrojicn content of the pMBoi protein: 
(b) generation of C.'. bofulimwi C fragment protein free of the MBP; (c) expression of C. 
homlmum C fragment protein using various expression vectors: and (d) purification of soluble 
( hodtlimim C fragment protein substantially free of signilkanl endotoxin contamination, 

a) Determination Of The Pyrogen Content Of The pMBot 
Protein 

In order to use a recombinant antigen as a vaccine in humans or other animals, the 
antigen preparation must be shown to be free of pyrogens. The most significant pyrogen 
present in preparations of recombinant proteins produced in gram-negaiive bacteria, such as £. 
coli. is endotoxin (l-.C. Pearson. Pyrogens: endotoxins. LAL lesiin^i and chpyro^cniaion. 
(1983) Marcel Dekkcr. New York, pp. 23-56]. I'o evaluate the utility of the pMBot protein 
as a vaccine candidate, the endotoxin content in MBP fusion proteins was determined. 

I hc endotoxin content of recombinant protein samples was assayed utilizing the 
Limulus assay (LAL kit: Associates of Cape Cod ) according to the manufacturer's 
instructions. Samples of affmity-purilied pMal-c protein and pMA 1 870-2680 were found to 
contain high levels of endotoxin (>50.000 lEU/mg protein: I'-LI (endotoxin unit)). This 
suggested that MBP- or toxin A repeat-containing fusions with the botulinal C fragment 
should also contain high levels of endotoxin. Accordingly, removal of endotoxin from 
alllnity-purifled pMal-c and pMBoi protein preparations was attempted as Ibllows. 

Samples of pMal-c and pMBot protein were depyrogenated with polymyxin to 
determine if the endotoxin could be easily removed. The following amount of protein was 
treated: 29 ml at 4.8 OD,a/ml for pMal-c and 19 mis at 1.44 OD,«,/ml for pMBot. The 
protein samples were dialyzed extensively against PBS and mixed in a 50 ml tube (Falcon) 
with 0.5 ml PBS-equilibrated polymyxin B (AtTi-Prep Polymyxin. BioRad). The samples 
were allowed to mix by rotating the tubes overnight at 4^C. The polymyxin was pelleted by 
cenlritiigaiion for 30 min in a bench top centrifuge at maximum speed < approximately 2000 x 
*l) and the supernatant was removed. The recovered protein (in the supernatanl) was 
quantified by OD,,,,,. and the endotoxin activity was assayed by LAL. In both cases only 
approximately 1/3 of the input protein was recovered and the polymyxin-ireated protein 
retained significant endotoxin contamination (approximately 7000 MU/mg of pMBol). 



- 154 - 



wo 98/08540 PCT/US97/15394 

The depyrogenaiion experiment was repealed using an independently purified pMal-c 
protein preparation and similar results were obtained. From these studies it was concluded 
that signitlcani levels of endotoxin copurifies with these MBP fusion proteins using the 
amylose resin. Furthermore, this endotoxin cannot be easily removed by polymyxin 
treatment. 

These results suggest that the presence of the MBP sequences on the fusion protein 
complicated the removal of endotoxin from preparations of the pMBot protein. 

b) Generation Of C. holulinum C Fragment Protein Free Of 
The MBP 

It was demonstrated that the pMBot fusion protein could not be easily purified from 
contaminating endotoxin in section a) above. The ability to produce a pyrogcn-free (cfi.. 
cndoioxin-frcc) preparation ol' soluble botulinal C fragment protein free of the MBP tag was 
next investigated. The pMBot expression construct was designed to facilitate purification of 
the botulinal (; fragment from the MBP tag by cleavage ol" the fusion protein by utilizing an 
engineered Factor Xa cleavage site present between the MBP and the botulinal C fragment, 
fhc l-actor .Xa cleavage was performed as follows. 

factor .Xa (New fingland Biolabs) was added to the pMBot protein (using a 0.1-1.0% 
Factor Xa/pMBot protein ratio) in a variety of buffer conditions (c'..t;.. PBS-NaCl (PBS 
containing 0..^ M NaCl). PBS-NaCI containing 0.2% Tween 20. PBS. PBS containing 0.2% 
Twecn 20. PBS-C (PBS containing 2 mM CaCK). PBS-C containing either 0.1 or 0.5 % 
Tween 20. PBS-C" containing either 0,1 or 0.5% NP-40. PBS-C containing cither 0.1 or 0.5% 
friion X-100. PBS-C containing 0.1% .sodium deoxycholaic. PBS-C containing 0.1% SDS). 
file Factor Xa digestions were incubated for 12-72 hrs at room temperature. 

fhe extent of cleavage was a.ssessed by Western blot or Coomassie blue staining of 
proteins following electrophoresis on denaturing SDS-PAGE gels, as described in Example 
22. Cleavage reactions (and control samples of uncleaved pMBot protein) were centrifuged 
for 2 min in a microfuge to remove in.solublc protein prior to loading the samples on the gel. 
The 1 actor Xa treated samples were compared with uncleaved. unccntrifuged pMBot samples 
on the same gel. The results of this analysis is .summarized below. 

1 ) Most (about 90%) pMBol protein could be removed by cenirifugation. even 
when uncleaved control samples NNcre utilized. This indicated that the pMBot fusion protein 
was not fully soluble {i.e., it exists as a suspension rather than as a solution). fThis result was 
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consistent with the observation that most affinity-purified pMBot protein precipitates after 
long term storage (>2 weeks) at 4°C. Additionally, ihe majority (i.e., 75%) of induced 
pMBot protein remains in the pellet after sonication and clarification of the induced E, coli. 
Rcsuspcnsion of these insoluble pellets in PBS followed by sonication results in partial 
solubilization of the insoluble pMBot protein in the pellets.) 

2) The portion of pMBot protein thai is fully in solution (aboui 10% of pMBot 
protein) is completely cleaved by Factor Xa. but the cleaved (released) botulinal C fragment is 
relatively insoluble such that only the cleaved MBP remains fully in solution. 

3) None of the above reaction conditions enhanced solubility without also 
reducing effective cleavage. Conditions that effectively solubilized the cleaved botulinal C 
fragment were not identified. 

4) The use of 0.1% SDS in the buffer used for 1-actor Xa cleavage enhanced the 
solubility oi' the pMBot protein (all of pMBoi protein was soluble). However, the presence of 
the SnS prevented any cleavage of the fusion protein with l-aetor Xa. 

5) Analysis of pelleted protein from the cleavage reactions indicated that both full 
length pMBot {i,c,. uncleavcd) and cleaved botulinal C fragment protein precipitated during 
incubation. 

These results demonstrate that purification of soluble botulinal C fragment protein after 
cleavage ol' the pMBol fusion protein is complicated by the insolubility of both the pMBot 
protein and the cleaved botulinal C fragment protein. 

c) Expression Or C botulinum C Fragment Using Various 
Expression Vectors 

In order to determine if the solubility of the botulinal C fragment was enhanced by 
expressing the C fragment protein as a native protein, an N-ierminal His-tagged protein or as 
a fusion with gluialhione-S-transfcrase (GST), alternative expression plasmids were 
constructed. These expression constructs were generated utilizing the methodologies described 
in i.*\ample 22. Figure 27 provides a schematic representation of the vectors described below. 

In I'igure 27. the following abbreviations are used. pP refers to the pl'T23 vector. 
pHIS refers to the pETHisa vector. pBlue refers to the pBluescript vector. pM refers to the 
pMAL-c vector and pG refers to the pGEX3T vector (described in Example 1 1). The solid 
black lines represent C. hottilinum C fragment gene sequences: the solid black ovals represent 
the MBP; the hatched ovals represent GST: "HHHHH" represents the poly-histidine lag. In 
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Figure 27. when the name for a resiriciion enzyme appears inside parenthesis, this indicates 
that the restriction site was destroyed during construction. An asterisk appearing with the 
name for a restriction enzyme indicates that this restriction site was recreated at a cloning 
Junction. 

i) Construction Of pPBot 

In order to express the C. hotulinum C IVagment as a native (/.c. non-fused) protein, 
the pl'l^ot plasmid (shown schematically in Figure 27) was constructed as follows. The C 
fragment sequences present in pAlterBot (E.xamplc 22) were removed by digestion of 
pAllerBoi with Mo\ and HindWl, The NcoMII'mJlW C fragment insert was ligaicd to 
pETIIisa \ecior (described in E.xampic 18b) vviiich was digested with /Va;l and HincAU, This 
ligation creates an expression construct in which the AVol-cncodcd methionine of the botulinal 
C" fragmeni is ihc initiator codon and directs expression of tlie naiix e botulinal C fraumeni. 
The ligation products were used to transform competent BL2l(Dl;;3)pl.ysS cells (Novauen). 
Recombinant clones were identified by restriction mapping. 

ii) Construction Of pHisBot 

In order to express the hotulinum C fragmeni containing a poly-hisiidine tag at the 
amino-tcrminus of the recombinant protein, the pHisBot plasmid (shown schematically in 
Figure 27) was constructed as follows. The jXcol/Himflll botulinal C fragment insert from 
pAlicrbot was ligated into the pI:Tllisa vector which was digested with A7/cl and HindlU. 
The \col (on the C* fragment insert) and Nhc\ (on the pFTHisa vector) sites were lllled in 
using the Klenovv fragment prior to ligation: these sites were then blunt end ligated (the Xt/cl 
bite was regenerated at the clone junction as predicted). The ligation products were used to 
transform comjietent BL21(DE3)pl.ysS cells and recombinant clones were identified by 
restriction mapping. 

fhe resulting pUisBot clone expresses the botulinal C fragment protein with a 
hislidine-tagged N-terminal extension having the following sequence: MetGlyllisllis 
^^^sHisllisllisHisHisHisl^isScrSer(ll.vHislleGluGlvAruHis MetAla. (SEQ M) NO:24): the amino 
acids encoded by the botulinal C fragment gene arc underlined and the vector encoded amino 
acids are presented in plain type. The nucleotide sequence present in the pHTHisa vector 
which encodes the pHisBot fusion protein is listed in SEQ ID NO:25. The amino acid 
sequence of the pMisRot protein is listed in SEQ H) NO:26. 
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iii) Construction Of pGBof 

The botulinal C fragment protein was expressed as a fusion with the glutathione-S- 
iransferase protein by constructing the pGBot plasmid (shown schematically in Figure 27). 
Tliis expression construct was created by cloning the A^,/I/.SV,/I C Jragmenl insert present in 
pBlueBot (Example 22) into the pGI£X3T vector which was digested with Smal and Xhol. 
The AW/I site (present on the botulinal fragment) was made blunt prior to ligation using the 
Klenow fragment. The ligation products were used to transform competent I3L2I cells. 

Each of the above expression constructs were tested by restriction digestion to confirm 
the integrity of the constructs. 

Large scale (I liter) cultures of pPBot [BL2l(DE.-?)pLysS host|. pilisBot 
|BL21(DE3)pL.ysS host] and pGBot (BL2I host) were grown in 2X YT medium and induced 
(u.sing IPTCI to 0.8-1.0 mM) for 3 hrs as described in C.xample 22. Total, soluble and 
in.soluble protein preparations were prepared from 1 ml aliquots of each large scale culture 
I Williams c, al. (1^94). .v,//,m| and analyzed by SDS-l>ACiK. .No obvi(u.s induced band was 
detectable in the pPBot or pHisBot samples by Coomassie staining, while i. prominent 
insoluble band of the anticipated MW was delected in the pCiBot sample. Soluble ly.sates of 
the pC.Bol large scale (resuspcnded in PBS) or plIisBoi large scale |resuspended in Novagen 
IX binding buffer (5 mM imidazole. 0.5 M NaCl. 20 mM IVis-IICI. pli 7.9)| cultures were 
prepared and used to alliniiy purify .soluble affinity-tagged protein as follows. 

■fhe pC}Bot lysate was affinity purified on a glutalhione-agaro.se resin (Pharmacia) 
exactly as described in Smith and Corcoran [Current Protocols in Molecular Biology. 
Supplement 28 (1994). pp. 16.7.1-I6.7.7|. The pHisBot protein was purified on the His-Bind 
resin (No\agen> utilizing the His-bind buffer kit (Novagen) exactly as described by 
manufacturer. 

Samples from the purification of both the pGBot and pHLsBot proteins (including 
uninduced. induced, total, soluble, and affinity-purified elutcd protein) were resolved on SDS- 
PAUE gels, f ollowing electrophoresis, proteins were analyzed by Coomassie staining or by 
Western blot detection utilizing a chicken anli-C. hi,u,limm lype A toxoid amibody (as 
described in Example 22). 

These studies showed that the pCiBot protein was almost emirely insoluble under the 
utilized conditions, while the pHisBot protein was soluble. Affinity purification of the 
pl lisBot protein on this first attempt was inefficient, both in tenns of yield (most of the 
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immunoreactive botulinal protein did not bind to the His-bind resin) and purity (the botuiinai 
protein was estimated to comprise approximately 20% of the total eluted protein). 

d) Purification Of Soluble C. botulinum C Fragment Protein 
Substantially Free Of Endotoxin Contamination 

The above studies showed that the pHisBot protein was expressed in E. coli as a 
soluble protein. However, the aftinity purification of this protein on the His-bind resin was 
\ery inefficient. In order to improve the affinity purification of the soluble pHisBot protein 
(in terms of both yield and purity), an alternative poly-histidine binding affinity resin (Ni- 
NTA ivsin: Qiagen) was utilized. The Ni-NTA resin was reported to have a superior binding 
alTinity (K.,- 1 x 10" at pH 8.0: Qiagen user manual) relative to the His-bi.id resin. 

A .soluble lysaie (in Novagen IX binding buffer) from an induced 1 liter 2X YT 
culture was prepared as described above. Briefly, the culture of pHisBot fi3121(Dn3)pLys.S 
hosti ua.s grown at .^T'C to an 0D«„. of 0,7 in I liter of 2X YT medium containing 100 
Mg/ml ampieillin. 34 ^g/'ml chloramphenicol and 0.2% gkico.se. Protein expression was 
induced by the addition of IPTG to 1 mM. Three hours after the addition of the IPTG. the 
cells were cooled for 15 min in a ice water bath and then centrifuged 10 min at 5000 rpm in 
a JAIO rotor (Bcckman) at 4»C. The pellets were rosuspended in a total volume of 40 mis 
Novagen l.\ binding buffer (5 niM imidazole. 0.5 M NaCI. 20 mM Tris-HCl. pll 7.9). 
transferred to two .>5 ml Oakridge tubes and frozen at -70«C for at least I hr. The tubes 
were thauecl and the cells were lysed by sonication (4 X 20 second bursts using a Branson 
.Sonilk-r 450 with a power setting of 6-7) on ice. The suspen.sion was clarified by 
centrifugation for 20 min at 0.000 rpm ( 10.000 x .u) in a JA-17 rotor (Beckman). 

The soluble lysate was brought to 0.1% NP40 and then was batch ab.sorhed to 7 ml of 
a 1:1 slurry ol" Ni-NTA resinrbiiiding buffer by stirring for I hr at 4°C. flic .slurry was 
poured into a column having an internal diameter of 1 or 2.5 cm (BioRad). The column was 
then washed sequentially with 15 mis of Novagen IX binding buffer containing 0.1% NP40. 
15 ml of Novagen IX binding buf fer. 15 ml wash bufTer (60 mM imidazole. 0.5 M NaCl. 20 
mM Tris-HC l. pH 7.9) and 15 ml NaHPO, wash buffer (50 mM NaHPO,. pH 7.0. 0.3 M 
NaCl. 10 % glycerol). The bound protein was eluted by protonation of the resin using elution 
buffer (50 mM NaHPO,. pH 4.0. 0.3 M NaCl. 10 % glycerol). The eluted protein was stored 
at 4''C\ 
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Samples of loial. soluble and eluled protein were resolved by SDS-PAGE. Protein 
samples were prepared for electrophoresis as described in Lxample 22b. Duplicate gels were 
stained with Coomassic blue to visualize the resolved proteins and C honduwm type A toxin- 
reactive protein was delected by Western blot analysis as described in lixampic 22b. A 
5 representative Coomassie stained gel is shown in Figure 28. In Figure 28. the following 
samples were loaded on the 12.5% acrylamidc gel. Lanes 1-4 contain respectively total 
proiein. soluble protein, soluble protein present in the llow-through of the Ni-N I'A column 
and affmily-purified pHisBot protein {ic. proicin released from ihe Ni-Nl A resin by 
proionation). Lane 5 contains high molecular weight protein markers (BioRad). 

'^^ I he purification of pHisBoi protein resulted in a yield of 7 mg of affinity purified 

protein from a I liter starting culture of BL2l(DE3)pLysS cells harboring the pHisBol 
plasmid. The yield of purified pHisBoi protein represented approximately 0.4% of the total 
soluble proicin in the induced culture. Analysis of the purified pHisBoi protein by SDS- 
PACih: revealed that at least 90-^)5% ol' the protein was present as a single band (I-iyure 28) of 

15 the predicted MW (50 kD). This 50 kD protein band was immunoreactive with anii-C. 

hafu/inimi type A toxin antibodies. I he extinction coefficient of the protein preparation was 
determined to be 1.4 (using the Pierce BCA assay) or 1.45 (using the Lowry a.ssay) ()D,^„ per 
1 mu/ml solution. 

Samples of pll neutralized eluted pHisBol protein were resolved on a KB 803 HPLC 
20 column i.Shodex). Although His-tagged proteins are retained by this sizing column (perhaps 
due to the inherent metal binding ability of the proteins), the relative mobility of the pHisBot 
protein was consistent with that expected for a non-aggregated proiein in .solution. Most ol* 
the induced pHisBoi protein was determined to be soluble under the growth and solubilization 
conditions utilized above (/.c, greater than 90% of the pHisBot protein was found to be 
25 soluble as judged by comparison of the levels of pHisBot proiein seen in total and soluble 
proiein samples prepared from BL2UDI£3)pLysS cells containing the pHisBot plasmid). 
SDS-PACiE analysis of samples obtained after centrifugalion. extended storage at -20**(\ and 
at least 2 cycles of freezing and thawing delected no protein loss (due to precipitation), 
indicating that the plIisBot protein is soluble in the eluiion buffer {i.e., 50 mM NallPO,,. pH 
30 4.0. 0.3 M NaCL 10 % glycerol). 

Determination of endotoxin contamination in the affinity purified pHisBot preparation 
(after pH neutralization) using the LAL assay (Associates of Cape Cod) delected no 
significant endotoxin contamination. 'I he assay was performed using the endpoint 
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chromogenic method fwiihout diazo-coupling) according lo the manufacturer's instructions. 
This method can detect concentrations of endotoxin greater than or equal to 0.03 EU/ml (KIJ 
refers to endotoxin units). The LAL assay was run using 0.5 ml of a solution comprising 0.5 
mg pHisBot protein in 50 mM NaHPO,. pl l 7.0. 0.3 M NaCI. 10 % glycerol; 30-60 FAJ Were 
detected in the 0.5 ml sample. Therefore, the affinity purified pHisBot preparation contains 
6(M20 EU/mg of protein. FDA Guidelines for the administration of parenteral drugs require 
that a composition to be administered to a human contain le.ss than 5 EU/kg body weight (The 
average human body weight is 70 kg; therefore up to 349 EIJ units can be delivered in a 
parental dose.). Because very small amount of protein arc administered in a vaccine 
preparation (generally in the range of 10-500 ^g of protein), administration of affinity 
puriticd pMisBot containing 60-120 EU/mg protein would result in delivery ol'only a small 
percentage of the permissible endotoxin load. For example, administration of 10-500 ^g of 
purified plIisBol to a 70 kg human, where the protein preparation contains 60 EU/mg protein, 
results in the introduction of only 0.6 to 30 EU [i.e.. 0.2 to 8,6% of the maximum allowable 
endotoxin burden per parenteral dose (less than 5 EU/kg body weight)]. 

The above results deiiionsiraie lhat endotoxin (EPS) does not copurify with the 
pHisBoi protein using the above purification scheme. Preparations of recombinanily produced 
pHisBot protein containing lower levels of endotoxin (less than or equal to 2 EU/ mg 
recombinant pn)iein) may be produced by washing the Ni-NTA column with wash buffer until 
the OD.j.^ returns to baseline levels {/.c. until no more UV-absorbing material eomes off of 
the column). 

I he above results illustrate a method for the production and purification of soluble, 
botulinal C fragment protein substantially free of endotoxin. 

EXAMPLE 25 

Optimization Of The Expression And Purification Of pHisBot Protein 

The results shown in Example 24d demonstrated that the pl IisBot protein is an 
excellent candidate for use as a vaccine as it could be produced as a soluble protein in coli 
and could be purified free of pyrogen activity. In order to optimize the expression and 
purification of the pHisBot protein, a variety of growth and purification conditions were 
tested. 
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a) Growth Parameters 
i) Host Strains 

The influence of the host strain utilized upon the production of soluble pliisBot 
protein was investigated. A larjie scale purification ol pHisBol was performed fas described 
in Example 24d above] using the BL2l(Dl-:3) host (Novaucn) rather than the 
BL2l(I)E3)pLysS host. The deletion of the pLysS plasmid in the BL2I(DE3) host yielded 
higher levels of expression due to de-repression of the plasmid s T7-lac promoter. However, 
the yield of affinity.purified soluble recombinant protein was very low (appro.ximately 600 
Mg/ liter culture) when purified under conditions identical to (hose described in Example 24d 
above. This result was due to the fact that expression in the BI.21(DE3) host yielded very 
high level expression of the plIisBoi protein as insoluble inclusion bodies as shown by .SDS- 
I'AGL analysis of protein prepared from induced BL21(DI-3) cultures (Figure 29. lanes 1-7. 
described below). These results demonstrate that the pllisBot protein is not inherently toxic 
to £. tali cells and can be expres.sed to high levels using the appropriate promoter/host 
combination. 

l igure 29 shows a Coomassie blue stained SDS-PACiH gel (12.5% acrylamide) onto 
which extracts prepared from BL21.(DE3) cells containing the pHisBot plasmid \\ere loaded. 
Each lane was loaded with 2.5 m1 protein sample mixed with 2.5 mI of 2X SDS sample buffer. 
The samples were handled as described in Example 22b. I he following samples were applied 
to the gel. Lanes 1-7 contain protein isolated from the BL2I(I)E3) host. Lanes 8-14 contain 
proteins i.solated from the BL21{DE3)pLysS host. lotal protein was loaded in lanes 1. 2. 4. 
6. «. 10 and 12. Soluble protein was loaded in Lanes 3. 5. 7. 9. I! and 13. Lane I contains 
protein from uninduced host cells. Lanes 2-L3 contain protein from host cells induced for 3 
hours. IPTG was added to a llnal concentration of 0.1 mM {Lanes 6-7). 0.3 mM (Lanes 4-5) 
or 1.0 mM (Lanes 2. 3, 8-13). The cultures were grown in LB broth (Lanes 8-0). 2X Y l 
broth (Lanes 10-1 1) or terrific broth (Lanes 1-7. 12-13). The plILsBot protein .seen in Lanes 
3. 5 and 7 is insoluble protein which spilled over from Lanes 2. 4 and 6. respectively. High 
molecular weight protein markers (BioRad) were loaded in Lane 14. 

A variety of expression conditions were tested to determine if the BI.21(DE3) host 
could be utilized to express .st)luble pHisBot protein at suitably high levels {i.e.. about 10 
mg/ml). The conditions altered were temperature (growth at 37 i>r 30«'C). culture medium 
(2X YT. LB or Terrinc broth) and inducer levels (0.1. 0.3 or 1.0 mM IPTG). All 
combinations of these variables were tested and the induction levels and solubilitv was then 
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assessed by SDS-PAGE analysis of total and soluble extracts [prepared from I ml samples as 
described in Williams el al.. (1994). supra]. 

All cultures were grown in 15 ml tubes (Falcon #2057). All culture medium was 
prew:armed overnight at the appropriate temperature and were supplemented with 100 ng/mi 
ampicillin and 0.2% glucose. Terrific broth contains 12 g/l bacto-tryptone. 24 g/l bacto-yeast 
extract and 100 ml/I of a solution comprising 0.17 M KH,PO,. 0.72 M K,III>0,. C ultures 
were grown in a incubator on a rotating wheel (to ensure aeration) to an 0I)„„ of 
approximately 0.4. and induced by the addition of IPTG. In all cases, high level expression 
of insoluble pHisBot protein was observed, regardless of temperature, medium or inducer 
concentration. 

I hc effect of varying the concentration of IPTG upon 2X YT cultures grown at 2.1»C 
then investigated. IPTG was added to a final concentration of either 1 mM. 0.1 mM. 
0.05 mM or 0.01 mM. At this temperature, similar level.s of pHis liot pro.cn was induced in 
Ihc presence of either I or 0.1 mM IPTG: these leveLs of expre.s.sion was lower than that 
observed at higher temperatures. Induced protein levels were reduced at 0.05 mM IPTG and 
absem at 0.01 mM IPTG (relative to 1.0 and 0.1 mM IPTG inductions at 23''C). However, 
no conditions were oKscrved in which the induced pHisBot protein was .soluble in this host. 
rhu.s. although expression levels are superior in the BL21(DF.3) host (as compared to the 
BL21(Df.-;)pLysS host), conditions that facilitate the production of soluble protein in this host 
could not be identified. 

These results demonstrate tiiat production of soluble pl IisBol protein was achieved 
using Ihc BL2MDFJ)p|.ysS host in conjunction with the T7-lac promoter. 

ii) Kffcct Of Varying Temperature, Medium And 
I PTC Concentration And Length Of Induction 

The etieet growing the host cells in various mediums upon the expression of 
recombinant botulinal protein from the pHisBot cxptes.sion construct [in the BL2I(I)I-3)pl.ys.S 
host! uas investigated. BL21(D0.3)pLysS cells containing the pHisBot plasmid were gro^v,V . 
in either I.B. 2X YT or Territk broth al 37T. The cells were Induced using 1 mM IPTG for 
a •> iir induction period. l£xpression of pHLsBot protein was found to be the highest when the 
cells were grown in 2X YT broth (see Figure 29. lanes 8-13). 

The cells were then grown at 30«C in 2X YT broth and the concentration of IPTG was 
varied from 1.0. 0.3 or 0.1 mM and the length of induction was either 3 or 5 hours. 
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Fxpression of pHisBol protein was similar at all 3 inducer concentrations utilized and the 
levels of induced protein were higher after a 5 hr induction as compared to a 3 hr induction. 

Using the conditions found to be optimal for the expression of pHisBoi protein, a large 
scale culture was grown in order to provide sufficient material for a large scale purification of 
5 the pHisBoi protein. Three 1 liter cultures were grown in 2X YT medium containing 100 
^ig/nil ampicillin. 34 |.ig/ml chloramphenicol and 0.2% glucose. The cultures were grown at 
30''C and were induced with 1.0 niM IPTG for a 5 hr period. The cultures were harvested 
and a .soluble lysaic were prepared as described in Example 18. A large scale purificaiion 
was performed as described in Example 24d with the exception that except the soluble lysaic 
10 was batch absorbed for 3 hours rather than for I hour. The ilnal yield was 13 mg pMisBot 
prolein/liler culture. The pHisBoi protein represented 0.75% of the total soluble protein. 

The above results demonstrate growth conditions under which soluble pllisBot protein 
is produced (/.c. use of the BL2l(DE3)pLysS host, 2X YT medium. 30"C. 1.0 niM IPTG for 
5 hours). 

15 

h) Optimization Of Purification Parameters 

l or Dpiimizaiion of purification condition.s. large scale cultures (3 X I liter) were 
grown at 3()°C and induced with 1 niM IPTG for 5 hours as described above. The cultures 
were pooled, distributed to centrifuge bottles, cooled and pelleted as described in Example 
20 24d. riie cell pellets were frozen at -70°C until used. Each cell pellet represented 1/3 of a 
liter starting culture and individual bottles were utilized for each opiimi/alion experiment 
described below. This standardized the input bacteria used for each experiment, such lliat the 
yields of affinity purified pHisBol protein could be compared between different optimization 
experiments. 

25 

i) Binding Specificity (pll Protonation) 

A lysaie of pHisBoi culture was prepared in PBS (pll «.0) and applied to a 3 ml Ni- 
NTA column equilibrated in PBS (pH 8.0) using a How rate of 0.2 ml/min (3-4 column 
volumes/hr) using an Econo chromatography system (BioRad). I he column was washed wiih 
30 PBS (pH 8.0) until the absorbanee (01)>«„) of the cluic was at baseline levels. The How rale 
was then increased to 2 ml/min and the column was equilibrated in PBS (pll 7.0). A pll 
gradient (pll 7.0 lo 4.0 in PBS) was applied in order to clute the bound pHisBoi protein from 
the column. Fractions were collected and aliquois were resolved on SDS-PAGE gels. The 
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PAGE gels were subjected lo Western blotting and the pHisBot protein was detected using a 
chicken anti-C hoiulimim l ype A toxoid antibody as described in Example 22. 

From the Western blot analysis it was determined that the pHisBot protein begins to 
eiute from the Ni-NTA column at pH 6.0. This is consistent with the predicted elution of a 
Mis-tagged protein monomer at pH 5.9. 

1 hesc results demonstrate that the pH at which the pHisBot protein is protonated 
(released* from Ni-NTA resin in PBS buffer is pH 6.0. 

ii) Binding Specificity (Imidazole Competition) 

In order to deline purification conditions under which the native E. cnii proteins could 
be removed from the Ni-NTA column while leaving the pHisBot protein bound lo the 
column, the following experiment was performed. A lysate of pHisBoi culture was prepared 
in 50 mM NallPO,. 0.5 M NaCI. 8 mM imidazole (pl i 7.0). This lysaic was applied to a .1 
ml Ni-NTA column equilibrated in 50 mM NaHPO,. 0.5 M NaCI (pH 7.0) using an Ecoiio 
chromatography .system (BioRad). A flow rate of 0.2 ml/min column volumes/hr) was 
utilized. The column was washed with 50 mM NaHPO^. 0.5 M NaCI (pH 7.0) until the 
absorbancc of the elutc returned to baseline, fhe How rate was then increased to 2 ml/min. 

I he column was eluted using an imidazole step gradient |in 50 niM NaHPO,. 0.5 M 
NaCI (pll 7.0(1. niution steps were 20 mM. 40 mM. 60 mM. 80 mM. 100 mM. 200 mM. 1.0 
M imidazole, followed by a wash using 0.1 mM EDTA (to strip the nickel from the column 
and remove any remaining protein). In each step, the wash was continued until the ()0,,„ 
returned to baseline, t ractions were resolved on SD.S-i'AGE gels. Western blotted, and 
pHisBot protein detected using a chicken anti-C. hottilimim Type A toxoid antibody as 
described in l-xample 22. Duplicate gels were stained with Coomassie blue to detect eluted 
protein in each fraction. 

The results of the PAGE analysis showed that most of the non-specitieally binding 
bacterial protein was removed by the 20 mM imidiazolc wash, with the remaining bacterial 
proteins being removed in the 40 and 60 mM imidazole washes, fhe pHisBot protein began 
to elute at 100 mM imidazole and was quantitatively eluted in 200 mM imidazole. 

These results precisely defined the window of imidazole wash stringency that 
opiimallv removes L. coli proteins from the column while spccirically retaining the pHisBot 
protein in this buffer. These results provided conditions under which the pHisBot protein can 
be purified free of contaminating host proteins. 
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iii) Purification Buffers And Optimized 
Purification Protocols 

A variety of purification parameters were tested durine the development of an 
optimized protocol for batch purification of soluble pHisBot protein. The results of these 
5 analyses are summarized below. 

Batch purifications were performed (as described in I-xample 24d) usiny several 
buffers to determine if alternative buffers could be utilized for binding of ihc pHisBot protein 
to the Ni-NTA column, it was determined that quantitative binding of pHisBot protein to the 
Ni-NTA resin was achieved in either Tris-HCI (pH 7.9) or NaHPO^ (pll 8.0) buffers. 

10 Binding of the pHisBot protein in NaHPOj buffer was not inhibited using 5 mM. 8 niM or 60 
mM imidazole. Quantitative clution of bound pllisBoi protein was oblained in buffers 
containing 50 mM NaUPO,. 0.3 M NaCl (pH 3.5-4.0), with or without 10% glycerol. 
However, ciuantitation of soluble afllnily purilled pHisBoi protein before and after a iVeeze 
thaw (following several weeks storage of the affinity purified elute at -2()°C) revealed that 

15 94% of the protein was recovered using the glycerol-coniaining buffer, hut only 6X% of the 
protein was recovered when the buf fer lacking glycerol was employed. This demonstrates 
that glycerol enhanced the solubility of the pHisBol protein in this low pM buffer when the 
eluied protein was .stored at freezing temperatures -20°C). Neutralization of pH by 
addition of NaH>POj buffer did not result in obvious protein precipitation. 

20 It was determined that quantitative binding of pllisBol protein using the batch format 

occurred after 3 hrs (Figure 30). but not after 1 hr of binding at 4°C' {the resin was stirred 
during binding). Figure 30 depicts a Coomaisse blue stained SDS-PACil- gel (7.5% 
acrylamide) containing samples of proteins isolated during the purification of pllisBot protein 
from lysittc prepared from the BL2l(f)R3)pl.ysS host. Haeh lane was loaded with 5 |.d of 

25 protein sample mixed with 5 }.il of 2X sample buffer and processed as described in Example 
22b. Lane 1 contains high molecular weight protein markers (BioRad). Lanes 2 and 3 
contain protein eluted from the Ni-NTA resin. Lane 4 contains soluble protein after a 3 hr 
batch incubation with the Ni-NTA resin. Lanes 5 and 6 contain soluble and total protein, 
respectively. Figure 30 demonstrates that the pHisBot protein is completely soluble [compare 

30 Lanes 5 and 6 which show that a similar amount of the 50 kD plUsBot protein is seen in 
both: if a substantial amount (greater than 20%) of the pHisBot protein were partially 
insoluble in the host cell, more pHisBot protein would be .seen in lane 6 (total protein) as 
compared to lane 5 (soluble protein)). Figure 30 also demonstrates thai the pHisBot protein is 
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completely removed from the lysate after batch absorption with the Ni-NTA rcsiii Ibr 3 hours 
(compare Lanes 4 and S). 

The reported high arHnity interaction of the Ni-NTA resin with His-tagged proteins 
(Kj= 1 x lO ' - at pH 8.0) suggested that it should be possible to manipulate the resin-protein 
complexes without signillcant release of the bound protein. Indeed, it was determined that 
after the recombinant protein was bound to the Ni-NTA resin, the resin-pHisUot protein 
complex was highly stable and remained bound following repeated rounds of centrifugaiion of 
the resin for 2 min at 1600 x When this cenirifugation step was performed in a 50 ml tube 
(Kalcon). a tight resin pellet formed. This allowed the removal of spent soluble lysaic by 
pouring off the supernatant followed by resuspension of the pellet in wash buffer. Further 
washes can be performed by centrifugaiion. The ability to perform additional washes permits 
the development of protocols for batch absorption of large volumes of lysate w ith removal of 
the lysate being performed simply by centrifugation following binding oi" the rccombinani 
protein to the resin. 

A simplilied. integrated puril'ication protocol was developed as follows. .A soluble 
lysate was made by rcsuspending the induced cell pellet in binding buffer (50 inM NaHPO,. 
U.5 M NaCI. 60 mM imidazole (pH 8.0) |. sonicating 4 x 20 sec and centrifuging for 20 min 
at 10.000 X }i. NP-40 was added to 0.1% and Ni-NTA resin (equilibrated in binding buffer) 
was added. Hight milliliters of a 1:1 slurry (resin:binding buffer) was used per liter of 
starting culture. I hc mixture was stirred for 3 hrs at 4°C. The slurry was poured into a 
column having a I cm internal diameter (BioRad). washed with binding buffer containing 
0.1% Nl'4(). then binding buffer until baseline was established (these steps may alternatively 
be performed by centrifugation of the resin, resuspension in binding bulfcr containing NP40 
followed by centrifugation and resuspension in binding buffer). Imidazole was removed by 
washing the resin with 50 mM NaHPO,. 0.3M NaCl (pH 7.0). Protein bound to the resin was 
eiuted using the .same buffer (50 mM NaHPOj, 0.3M NaCl) having a reduced pH (pll 3.5- 
4.0). 

A pilot purification was performed following this protocol and yielded 18 mg/liter 
ainnity-purilied pHisBot. The pHisBot protein was greater than 90% pure as estimated by 
C ooma.ssie .staining of an SDS-PA(}1: gel. This represents the highest ob.served yield of 
soluble aftinity-purified pHisBot protein and this protocol eliminates the need for separate 
imidazole-coniaining binding and wash buflers. In addition to providing a simplified and 
efficient protocol for the atTmity puritlcation of recombinant pHisBol protein, the above 
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results provide a variety of purification conditions under which pHisBot protein can be 
isolated. 



EXAMPLE 26 

The pHisBot Protein Is An Effective Immunogen 

In lixampic 23 it was demonstrated that nculraliziny antibodies arc generated in mouse 
serum alter nasal immunization with the pMBot protein. However, the pMBot protein was 
lound to copurify with significant amounts of endotoxin which could not be easily removed. 
The pHisBot protein, in conira.st. could be isolated free of siiinif leant endotoxin contamination 
making pllisBot a superior candidate for vaccine production. To further as.sess the .suitability 
of pHisBot as a vaccine, the immunogenicity of the pHisBot protein was determined and a 
comparLson of the relative immunogenicity of pMBot and pHisBot proteins in mice was 
performed as follows. 

fwo groups of eight BALBc mice were immunized with either pMBot protein or 
pHisBot protein u.sing C}erbu OMDP adjuvant (CC Biotech). pMBot protein (in PB.S 
containing 10 niM malto.se) or pHisBot protein (in .SO mMNaHPO,. ()..> M NaCl. iOVo 
glycerol, pl l 4.0) was mixed with Ocrbu adjuvant and used to immunize mice, l-aeh mouse 
received an IP injection of 100 ^ll antigen/adjuvant mix (50 Mg antigen plus 1 ^.ig adjuvant) on 
day 0. Mice were boosted as described above with the exception that the route of 
administration was IM on day 14 and 28. The mice were bled on day 77 and anti-C, 
hontliniim I ype A to.xoid titers were determined using serum collected from individual mice 
in each group (as described in li.xample 23). The results arc shown in Table 41. 
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TABLK 41 
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The results shown above in Table 41 demonstrate that both the pMBot and pHisBot 
proteins are immunogenic in mice as 100% of the mice (8/8) in each group serocon verted 
iVom non-immune to immune status. The results also shovv that the average titer oi' anli-C. 
hamNmim I ype A toxoid IgO is 20 fold higher after immuni7iition with the plIisBoi protein 
relative to immunization with the pMBot protein. This suggests that the pMisBot protein ma\ 
be a superior immunogen to the pMBot protein. 



EXAMPLE 27 

Immunization With The Recombinant 
pHisBot F^rotcin Generates Neutralizing Antibodies 



The results shown in Example 26 demonstrated that both the pl lisBot and pMBot 
proteins were capable of inducing high tilers of anti-C*. hmulinum type A toxoid-reaciive 
antibodies in immunized hosts. The ability of the immune sera from mice immunized with 
either the pHisBot or pMBot proteins to neutralize ( hotulimim type A toxoid in vivo was 
determined using the mouse neutralization assay described in Example 23b. 
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The iwo groups of eighi BALBc mice immunized wiih either pMBoi protein or 
pHisBoi protein in Example 26 were boosted again one week after the bleeding on day 77. 
The boost was performed by mixing pMBot protein (in PBS containing 10 mM maltose) or 
pHisBoi protein (in 50 mM NaHPO,, 0.3 M NaCl. 10% glycerol. pH 4.0) with Gerbu 
adjuvant as described in Example 26. Each mouse received an IP injection of 100 pi 
antigen/adjuvant mix (50 |.ig antigen plus I ^ig adjuvant). The mice were bled 6 days after 
this boost and the scrum from mice within a group was pooled. Serum from preimmune mice 
was also collected (this scrum is the same serum described in the footnote to Tabic 41 ). 

The presence of neutralizing antibodies in the pooled or preimmune scrum was 
detected by challenging mice with 5 LD^, units of type A toxin mixed with 100 of pooled 
serum. The challenge was performed by mixing (per mouse to he injecicd) 100 ^1 of scrum 
from each pool with 100 mI of purified type A toxin standard (50 LD,„ /ml prepared as 
described in l-l.xamplc 23b) and 500 ^1 of gcl-phosphaie. I he mixtures were incubated for M) 
min at room temperature with occasional mixing. Rach of four mice were injected IP with 
15 the mixtures (0.7 ml/mousc). fhe mice were observed for signs of botulism for 72 hours. 
Mice receiving loxin mixed with serum from mice immunized with either the pMisBoi or 
pMBoi proteins showed no signs of botulism intoxication. In contrast, mice receiving 
preimmune serum died in less than 24 hours. 

These results demonsiraic that antibodies capable of neutralizing ( hondmum tvpe A 
20 toxin arc induced when cither of the recombinant (/. boiulimim C fragment proteins pHisBot 
or pMBot are used as immunogens. 



EXAMPLE 28 

Cloning And Expression Of The C Fragment of hotulimim 
Serotype A Toxin In E. co/i Utilizing A Native Gene Eragmcnt 



In Example 22 above, a synthetic gene was used to express the C fragment of (' 
hoiulhwm serotype A toxin in £*. volL fhe synthetic gene replaced non-preferred (/.c. rare) 
codons present in the C fragment gene with codons which are preferred by voli. I he 
synthetic gene was generated because it was been reported that genes which have a high A/T 
content (such as most clostridial genes) creates expression difficulties in £ toli and yeast. 
Furthermore. LaPenoiierc et al. suggested that problems encountered with the stability (non- 
fusion constructs) and solubility (MBP fusion constructs) of the C fragment of C. botulimtm 
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serotype A toxin when expressed in £. coli was most likely due to the extreme A/T richness 
or the native C. boiulinum serotype A toxin gene sequences ( LaPenotiere. el ai. .supra). 

In this example, it was demonstrated that successful expression of the C fragment of 
( •. hnndinum type A toxin gene in £ coll docs not require the elimination of rare codons 
U.C.. there is no need to use a synthetic gene). This example involved a) the cloninu of the 
native C fragment of the C. serotype A toxin gene and construction of an^ 

expression vector a.id b) a comparison of the expression and purif.cation vieids of C 
h,m,nmm, serotype A C fragments derived from native and synthetic expression vectors. 

a) Cloning Of The Native C Fragment Of The C. hotulinum 

Serotype A Toxin Gene And Construction Of An Expression 
Vector 

The serotype A toxin gene was cloned from C. bniulinim genomic DNA using 1>CR 
amplillcation. The following primer pair was employed: 5'-CGCCATGGCTAG 
ATTATTATCTACATTTA(;-.r (.V primer. Nco\ site underlined: SEQ ID NO;29) and 
5 -C;(-AMirriCTTC;ACAGACTCAT(JTAG-3- (.V primer. Hind\\\ site underlined: SEQ ID 
N():3()). ( -. hoiulinum t) pe A strain was obtained from the American l ype Culture Collection 
(A I ( ( #iy.-;97) and grown under anaerobic conditions in Terrific broth medium. High 
molecular-weight C homlinum DNA was isolated as described in f-i.xample 1 1. The integrity 

yield oi genomic DNA was assessed by comparison with a serial dilution of uncut lambda 
DNA after electrophoresis on an agarose gel. 

The gene fragment was cloned by ITR utilizing a proofreading thermostable DNA 
polymerase (native /'/„ polymerase). PCR amplification was performed using the above 
primer pair in a 50^1 reaction containing lOmM Tris-HCl (pH 8.3). 50mM KCI. l.5mM 
MgCI, 2(J();,M each dNTP. 0.2mM each primer, and 50ng C". huulmum genomic DNA. 
Reactions were overlaid with 100^1 mineral oil. heated to 94"C 4 min. 0.5/xl native Pfu 
polymerase (Stratagcne) was added, and thirty cycles comprising 94T for I min, SO'C for 2 
■nin. 72-C for 2 min were carried out followed by 10 min at 72-C. An aliquot (10^1) of the 
reaction mixture wa.s resolved on an agarose gel and the amplified native C fragment gene 
was gel purified using the Prep-A-Ciene kit (BioRad) and ligatcd to pCRScript vector DNA 
(Stratagene). Recombinant clones were isolated and confirmed by restriction digestion, using 
standard recombinant molecular biology techniques |Sambrook el al. (1989). .supra]. In 
addition, the .sequence of approximately .100 bases located at the 5' end of the C fragment 
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coding region were obtained using standard DNA sequencing methods. The sequence 
obtained was identical to thai of the published sequence. 

An expression vector containing the native C, hotulimim serotype A C fragment gene 
was created by ligation of the NcoUllmdXW fragment containing the C fragment gene from the 
5 pCRScript clone lo Nhd-IIinAWX restricted pETHisa vector (H.xample 18b). The Nco\ and 
;V77c'i sites were lllled in using the Klenow enzyme prior to ligation: these sites were thus 
blunt-end ligated together, fhe resulting construct was termed pHisBotA (native). pHisBolA 
(native) expresses the C*. hotulimmi serotype A C fragment with a his- lagged N terminal 
extension which has the following sequence: 

10 Met(ilyHisHisllisHisllisllisHisHisHisHisSerSerGlvHis//L<;//^^^^ (SEQ ID 

N():24), where the underiining represents amino acids encoded by the (\ hoiulinum C 
fragment tiene (this N lerminal extension contains the recognition site for l-aciorXa proiease, 
shown in italics, which can be employed lo removed ihe polyhistdine tract from ihe N- 
terminus of the fusion protein). I he pHisBol (native) construct expresses the idenlical protein 

15 as the pI lisBol construct (Ex. 24c: herein after the pHisBotA) which contains the synthetic 
gene. 

The predicted DNA sequence encoding the native (' hoiuHnum serotype A C fragment 
gene contained within pHisBotA (native) is lisied in Sl£0 ID NOiT^l (the start of translation 
{AT(i) is located ai nucleotides 108-1 10 and the stop of translation (TAA) is located at 
20 nucleotides 1 494- 1496 in SEQ ID N():3I1 and the corresponding amino acid sequence is listed 
in SF:Q ID NC):26 [i.e., the same amino acid sequence as that produced by pHisBotA 
containing synthetic gene sequences). 

b) Comparison Of The Expression And Purificatiiin Yields Of 
25 C hotulimim Serotype A C Fragments i>crivcd From Native 

And Synthetic Expression Vectors 

Recombinant plasmids containing cither the native or the synthetic ( hnfniinum 
serotype A C* fragment genes were transformed into /:. coli strain BI21(DE3) pLysS and 
protein expression was induced in 1 liter shaker llask cultures. Total protein extracts were 
30 isolated, resolved on SDS-PAGE gels and C. honilinum C fragment protein was identified by 
Western analysis utilizing a chicken anti-C* ho/ulinum serotype A toxoid antiserum as 
described in Example 22. 
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Briefly. 1 liier (2XYT + 100 ^g/ml ampicillin and 34 pg/ml chloramphenicol) cultures 
of bacteria harboring either the pHisBoiA (synlhetic) or pHisBolA (native) plasmids in the 
BIZUDEj) pl.ysS strain were induced to express recombinant protein by addition of IPTG to 
ImM. Cultures were grown at 30-32°C. IPTG was added when the cell density reached an 
()D^.,H, 0,5-1.0 and ihc induced protein was allowed to accumulate for 3-4 hrs after induction. 

The cells were cooled for 15 min in a ice water bath and then cenlrifuged for 10 min 
at 5000 rpm in a JAIO rotor (Beckman) al 4°C. The cell pellets were resuspended in a total 
volume of 40 mis IX binding buffer (40 mM imidazole. 0.5 M NaCI. 50 mM NaPO,. pM 
8.0). transferred to two 50 ml Oakridge lubes and frozen at -70**C for at least 1 hr. The tubes 
were then thawed and the cells were lyscd by sonication (using four successive 20 second 
bursts) on ice. The suspension was clarilled by centrifugation 20-30 min at 9,000 rpm 
(lO.OOO.if) in a JA-17 rotor. The soluble lysale was batch absorbed to 7 ml of a 1:1 slurry of 
NiNTA resin:binding buffer by stirring 2-4 hr at 4^C. The slurry was cenlrifuged for I min 
at 5()0.t,' in 50 ml lube (Falcon), resuspended in 5 mis binding buffer and poured inlo a 2.5 cm 
diameter column (BioRad). The column was aiiachcd lo a UV monitor (ISCO) and the 
column wiis washed wiih binding buffer until a baseline was established. Imidazole was 
removed by washing vviih 50mM NaPO,. 0.3 M NaCl. 10% glycerol. pH 7.0 and bound 
protein was eluicd using 50mM NaPOj. 0.3 M NaCI. 10% glycerol. pH 3.5-4.0. 

fhe eluled proteins were stored at 4°C. Samples of total, soluble, and eluied proteins 
were resolved by SDS-PAGU. Protein samples were prepared for electrophoresis by mi.xing 
Uil total (T) or soluble (S) protein with 4 yi\ PBS and 5 i-il 2X SDS-PAGI- sample buffer, or 
5 Ml eluied (C) protein and 5 pi 2X SDS-PAGE sample buffer, fhe .samples were heated to 
^)yC lor 5 min. then cooled and 5 or 10 fi\s were loaded on 12.5% SDS-PAGE gels. Broad 
range molecular weight protein markers (BioRad) were also loaded to allow the MW of the 
identified fusion proteins to be esiimaied. After electrophoresis, protein was delected either 
generally by staining gels with Coomassie blue, or specifically, by blotting lo nitrocellulose 
for Western blot detection of specific immunoreactive protein. 

1 or Western bloi analysis, the gels were blotted, and protein transfer was confirmed by 
Ponceau S staining as described in lixample 22. After blocking the blots for I hr at room 
temperature in blocking buffer (PBST and 5% milk), 10 ml of a 1/500 dilution of an ami-C. 
honilhmm toxin A IgY PEG prep (Ex. 3) in blocking buffer was added and the blots were 
incubated for an additional hour at room temperature. The blots were wa.shed and developed 
using a rabbit ami-chicken alkaline phosphatase conjugate (Boehringer Mannheim) as the 
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secondary antibody as described in Ex. 22. This analysis detected C homlinum toxin A- 
reactive proteins in the plIisBotA (native and synihetic) protein samples (corresponding lo the 
predicted full length proteins identiHed by Coomassie staining). 

A gel containing proteins expressed from the pHisBoi and pHisBot (native) constructs 
during various stages of purification and stained with Coomassie blue is shown in Figure 31. 
In Figure 31. lanes 1-4 and 9 contain proteins expressed by the pHisBotA construct (U.. the 
synihetic gene) and lanes 5-8 contain proteins expressed by the pHisBotA (native) construct. 
Lanes I and 5 contain total protein extracts; lanes 2 and 6 contain soluble protein extracts; 
lanes 3 and 7 contain proteins which flowed through the NiNTA columns; lanes 4. 8 and 0 
contain protein eluted from the NiN TA columns and lane 10 contains molecular weight 
markers. 

The above purification resulted in a yield of 3 mg (native gene) or 1 1 mg (.synthetic 
gene) of affinity purified protein from a I liter starting culture, of which at least 90-95% of 
the protein was a single band of the predicted MW (50kd) and immunoreactivity for 
recombinant C. haiulimim serotype A C fragment protein. Other than the level of expression, 
no dilTerence was observed between the native and the synthetic gene expression .systems. 

These results demonstrate that soluble C. hotulimim .serotype A C fragment protein can 
be expressed in li coli and purified utilizing either native or .synthetic gene sequences. 

EXAMPLE 29 

Generation Of Neutralizing Antibodies Using A Recombinant 
C. hnitilimim Serotype A C Fragment Protein Containing A Six Residue Ilis-Tag 

111 Lxample 27. neutralizing antibodies were generated utilizing the pHisBotA protein, 
which contains a histidine-tagged N-tcrminal extension comprising 10 histidine residues. To 
determine if the generation of neutralizing antibodies is dependent on the presence of this 
particular his-tag. a protein containing a shorter N-terminal extension (comprising 6 histidine 
residues) was produced and tested for the ability to generate neutralizing antibodies. I his 
example involved a) the cloning and expression of the p6HisBotA(syn) protein and b) the 
generation and characterization ol' hyperimmune serum. 
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a) Cloning And Expression Of The p6HisBotA(syn) Protein 

The p6HisBotA(syn) construct was generated as described below: the term "syn" 
designates the presence of synthetic gene sequences. This construct expresses the C frgament 
of tlie hoiulinum serotype A toxin with a histidine-tagged N terminal extension having the 
following sequence: MetHisHisHisHisHisHisMslAia (SKQ H) NO:32): the amino acids 
encoded by the botulinal C fragment gene are underlined and the vector encoded amino acids 
are presented in plain type. 

6XHis oligonucleotides {5--TATGCATCACCATCACCA rCA-.r (SEQ ID NO:33) and 
5--CATGTGATGGTGATGGTGATC,CA-r (SEQ ID NO:34) were annealed as Ibllows. One 
microgram of each oligonucleotide was mixed in total of 20 mI IX reaction buffer 2 fNEB) 
and the mixture was heated at 70°C for 5 min and then incubated at 42°C for 5 min. The 
annealed oligonucleotides were then ligated wiih gel purified NtlcUHindW cleaved pET23b 
(T7 promoter) or pET21b (T7lac promoter) DNA and the uel purified NcoUflindlU C 
hontlinimi serotype A C fragment synthetic gene fragment derived from pAltcrBot (Ex. 22). 
Recombinant clones were isolated and confirmed by restriction digestion. The DNA sequence 
encoding the 6X his-tagged BotA protein contained within p6HisBotA(syn) is listed in SEQ 
ID NO:.35. I he amino acid sequence of the p6XHisBotA protein is listed in SEQ ID N():36. 

The resulting recombinant p6XllisBolA plasmid was transformed into the UI.2I(DE3) 
pLysS .strain, and I liter cultures were grown, induced and harvested as described in Example 
28. I lis-taggcd protein was purified as described in Example 28. with the following 
modifications. The binding buffer (BB) contained 5 mM imidazole rather than 40 mM 
imidazole and NP4() was added to tiie soluble lysate to a final concentration of 0.1%. The 
bound material was wa.shed on the column with BB until the baseline was established, then 
the column was washed successively with BB+20 mM imidazole and BB+40 niM imidazole. 
The column was eluted as described in Example 28. 

In the case of the pET23-derived expression .system, high level expres.sion of insoluble 
6HisBotA protein was induced. The pET2|.derived vector c.xpres.sed lower levels of soluble 
protein that bound the NiNTA resin and eluted in the 40 mM imidazole wash rather than 
during the low pH elution. These results (i.e.. low level expression ol' a .soluble protein) arc 
consistent with the results obtained with pHisBotA protein (Ex. 25): the pHisBotA consiruct. 
like the pET2 1 -derived vector, contains the T7lac rather than T7 promoter. 

The 6HisBotA protein thus elutes under less stringent conditions than the lOX 
histidine-eontaining pHisBot protein « 100-200 mM imidazxjle: Ex. 25) presumably due to the 
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reduction in the length of the his-iag. The eluted protein was of the predicted size [Le,. 
slightly reduced in comparison to pHisBotA protein]. 

b) Generation And Characterization Of Hyperimmune Serum 

Eight BALBc mice were immunized with purified 6HisBotA protein using Gerbu 
GMDP adjuvant (CC Biotech). The 40 mM imidazole eluiion was mixed with Gerbu 
adjuvant and used to immunize mice. Each mouse received a subcutaneous injection of 100 

antigen/adjuvant mix (12 jig antigen + 1 ^g adjuvant) on day 0. Mice were subcuianeously 
boosted as above on day 14 and bled on day 28. Control mice received pIlisBotB protein 

10 (prepared as described in Ex. 35 below) in Gerbu adjuvant. 

Anii-C. hnndinum serotype A toxoid liters were determined in serum from individual 
mice from each group using the ELISA described in Example 23u wiih the exception that the 
initial testing scrum dilution was 1:100 in blocking buffer conlainini; 0.5% Tween 20. 
followed by serial 5-fold dilutions into this buffer, fhc results of the EI. ISA demonstrated 

15 that seroconversion (relative to control mice) occurred in all 8 mice. 

Ihe ability of the anii-C. hotuUmoYt serotype A C fragment antibodies present in scrum 
Irom the immunized mice to neutralize native C* hotulmum type A toxin was tested using the 
mouse neutralization assay described in Example 23b. The amount of neutralizing antibodies 
present in the serum of the immunized mice was determined using scrum antibody titrations. 

20 The various serum dihitions (0.01 ml) were mixed with 5 LD<„ units of (\ botuhnum type A 
toxin and the mixtures were injected IP into mice, fhc neutralizations were performed in 
duplicate. 1'he mice were then observed for signs of botulism for 4 days. Undiluted serum 
was lound to protect 100% of the injected mice while the 1:10 diluted serum did not. This 
corresponds to a neutralization titer of 0.05-0.5 ID/ml. 

-5 fhese results demonstrate that neutralizing antibodies were induced when the 

6HisBoiA protein was utilized as the immunogen. Furthermore, these results demonstrate that 
seroconversion and the generation of neutralizing antibodies does not depend on the specific 
N terminal extension present on the recombinant C. hoiulmiim type A C fragment proteins. 
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EXAMPLE 30 

Construction Of Vectors For The Expression Of His-Tagged 
C. holulinum Type A Toxin C Fragment Protein Using the Synthetic Gene 

A number of expression vectors were constructed which contained the synthetic C. 
holulinum type A toxin C fragment gene. These constructs vary as to the promoter (Tl or 
T7lac) and repressor elements (laclq) present on the plasmid. Tiie T7 promoter is a stronger 
promoter than is the T7lac promoter. The various constructs provide varying expression 
levels and varying levels of plasmid stability. This example involved a) the construction of 
expression vectors containing the synthetic C holulinum type A C fragment gene and b) the 
determination of the expression level achieved using plasmids containing either the kanamycin 
resi.siancc or the ampicillin resistance genes in small scale cultures. 

a) Construction Of Expression Vectors Containing The 

.Synthetic C. botulinum Type A C Fragment Gene 
I:xprcs.sion vectors containing the .synthetic C. holulinum lypc A C fragment gene were 
engineered to utilize the kanamycin resistance rather than the ampicillin resistance gene. This 
was done for several reasons including concerns regarding the presence of residual ampicillin 
in recombinant protein derived from plasmids containing the ampicillin resistance gene. In 
addition, ampicillin resistant plasmids are more difficult to maintain in culture: the (5- 
lactamase .secreted by cells containing ampicillin resistant plasmids rapidly degrades 
extracellular ampicillin. allowing the growth of plasmid-negative cells. 

A second altered feature of the expressi(»n vectors is the inclusion of laclq gene in the 
plasmid. This repressor lowers expression from lac regulated promoters (the chromosomally 
located, lactose regulated T7 polymerase gene and the pla.smid located T7lac promoter). This 
down regulates uninduced protein expression and can enhance the stability of recombinant cell 
lines. The final alteration to the vectors is the inclusion of either the T7 or T7lac promoters 
that dri\'e high or moderate level expression of recombinant protein, respectively. 

■fhe expression plasmids were constructed as follows. In all cases, the protein 
expressed is the pHisBotA(syn) protein previously described, and the only differences between 
constructs is the alteration of the various regulatory elements described above. 
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i) Construction Of pHisBotA(syn) kan T7iac 

The pHisBotA(syn) kan T71ac construct was made by inserting ihe SapVXhol fragment 
containing the C hotulimm type A C fragment from pHisBotA(syn) inio pET24 digested with 
Sapl/Xhol (Novagen: fragment contains kan gene and origin of replication). The desired 
construct was selected for kanamycin resistance and confirmed by restriction digestion. 

ii) Construction Of pHisBotA(syn) kan laclq T7Iac 

The pHisBotA(syn) kan laclq T7lac construct was made by inserting the Xhal/Hindlll 
fragment containing the C boiulmum type A C fragment from pHisBolA(syn)kan r7lac into 
the pF.T24a vector digested with XbuVHinAlW. The resulting construct was confirmed by 
restriction digestion. 

iii) Construction Of pHisBotA(syn) kan laclq T7 

The pl IisBotA(syn) kan laclq T7 construct was made by inserting the Xha\/Hind\\\ 
fragment containing the hondimmi type A C fragment from pHisBoiA(syn) kan laclq T7Iac 
into A7K/l/////7dlll-digestcd pHisBotB(syn) kan laclq T7 (described in lix 37c below). The 
resulting construct was confirmed by restriction digestion. 

b) Determination Of The Expression Level Achieved Using 

Plasmids Containing Either The Kanamycin Resistance Or 
The Ampicillin Resistance Genes In Small Scale Cultures 

One liter cultures of pHisBolA(syn) kan T7]ac/BI21(T)E3)pl.ysS and pHisBoiA(syn) 
amp T71ac/BI2l(DE3)pLysS (this is the previously designated pHisBotACsyn) construct) were 
grown, induced and his-lagged proteins were purified as described in Example 28. No 
differences in yield or protein integrity/purity were observed. 

These results demonstrate that the antigen induction kvels from expression constructs 
were not affected by the choice of ampicillin versus kanamycin antibiotic resistance genes. 
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EXAMPLE 31 

Fermentation Of Cells Expressing Recombinant Botulinal Proteins 

a) Fermentation Culture Of Cells Expressing Recombinant 
3 Botulinal Proteins 

I'tTmcntaiion cultures were grown under the following conditions which were 
optimized for growth of the BL21{DE3) strains containing pET derived expression vectors. 
An overnight I liter feeder culture was prepared by inoculating of I liter media (in a 2L 
shaker tlask) with a tVesh colony grown on an LB kan plate. The feeder culture contained: 

10 600 mis nitrogen source [20 gm yeast extract (BBL) and 40 gm irypione (BBL)/600 misj, 200 
mis 5X fermentation salts (per liter: 48.5 gm K>HPO,, 12 gm NaH^POj'H^O. 5 gm NH.Cl, 
2.5 gm NaCl), 180 mis dH.O. 20 mis 20% glucose, 2 mis i M MgSO,, 5 mis 0.05M CaCK 
and 4 mis of a 10 mg/ml kanamycin stock. All solutions were sterilized by auioclaving. 
except the kanamycin stock which was filter sterilized. 

An aliquot (5 ml) of the feeder culture broth was removed prior to inoculation, and 
grown ibr 2 days at 3TC as a culture broth sterility control. Growth was not observed in this 
control culture in any of the fermentations performed. 

The inoculated feeder culture was grown for 12-15 hrs (ON) at 30-37°C. Care was 
taken to prevent oversaturaiion of this culture. The saturated feeder culture was added to lOL 

20 of fermentation media in fermenter (BiofloIV. New Brunswick Scientillc. l-dison. NJ) as 
follows, fhe fermenter was sterilized 120 min at 12rC with dH,0. The sterile water was 
rcmoxcd. and fermentation media added as follows: 6 liters nitrogen source. 2 liters 5X 
fermeniaiion salts, 2 liters 2% glucose. 20 mis 1 M MgSO,, 50 mis 0.05 M CaCU. 2.5-3.5 
mis iVIacol P 400 anlifoam (PPG Industries inc.. Gurnee. ID, 40 mis lOmg/ml kanamycin and 

25 10 mis trace elements (8 gm FeS0/7H_,0. 2 gm MnSO,»HX). 2 gm A1CI,*6H,0. 0.8 gm 
CoCNGU.O. 0.4 gm ZnS0,»7H>0. 0.4 gm Na.MoO/2H30. 0.2 gm CuCU»2H,0. 0.2 gm 
NiCI,. 0.1 gm H.iBOy200mls 5 M HCI). All solutions were sterilized by autoclaving. except 
the kanamycin slock which was filler sterilized. Fermentation media was prewarmed to ST^'C 
before the addition of the feeder culture. 

'^^^ After the addition of the feeder culture, the culture was fermented at ?y7X\ 400 rpm 

agitation, and 10 1/min air sparging. The DO, control was set to 20% PID and dissolved 
o.xygen levels were controlled by increasing the rate of agitation from 400-850 rpm under 
IKK control. D0> levels were maintained at greater than or equal to 20% throughout the 
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entire fermentation. When agitation levels reached 500-600 rpm the temperature was lowered 
to SOX to reduce the oxygen consumption rate. Culture growth was continued until 
endogenous carbon sources were depleted. In these fermentations, glucose was depleted first 
[monitored with a glucose monitoring kit (Sigma)], followed by assimilation of acetate and 
other acidic carbons [monitored using an acetate test kit (Boehringcr Mannheim)). During the 
assimilation phase, the pH rose from 6.6-6.8 (starting pH) to 7.4-7.5. at which time the bulk 
of the remaining carbon source was depleted. This was signaled by a drop in agitation rate 
(from a maximum of 700-800 rpm) and a ri.se in DO, levels >3()%. This corresponds to a 
0D„„ reading of 18-20/ml. At this poim a fed batch mode was initiated, in which a feed 
solution of 50% gluco.sc was added at a rate of approximately 4 gm gluco.se/liter/hr. The pH 
was adjusted to 7.0 by the addition ol" 25% H;P04 (approximately 60 mis). Culture growth 
was continued and reached peak oxygen consumption within ihc next 3 lirs of growth (while 
the remaining residual non-glucose carbon sources were a.ssimilatcd). This phase is 
characterized by a slow increase in pi I. and air sparging was increased to 15L/min. to keep 
the maximum rpm below 850. Once the residual acidic carbon sources are depleted the 
agitation rate decrea.ses to 650-750 rpm and the pH begins to drop. pH control w-as 
maintained at 7.0 PID by regulated pump addition of a sterile 4M NaOI l solution which was 
consumed at a steady rale for the remainder of the fermentation. Growih was continued at 
30°C. and the cultures were grown linearly at a growth rate of 4-7 ()b«„ unius/hr. to at least 
81.5 ()D,.„„ units/ml (>30g/l dry cell weight) without induction. Antifoam (a 1:1 dilution with 
filler sterilized 100% ethanol) was added as necessary throughout the fermentation to prevent 
foaming. 

During the fed batch mode, glucose was assimilated immediately (concentration in 
media consistently less than 0. 1 gm/liter) and acetate was not produced in significant levels by 
the pET plasmid/BL21{DE3) cell lines tested (approximately 1 gm/Iiter at end of 
fermentation: this is lower ilian that observed in harvests from shaker flask cultures utilizing 
the same strains). This was fortuitous, since high levels of acetate has been shown to inhibit 
induction levels in a variety of expression systems. The above described conditions were 
found to be highly reproducible between fermentations and utilizing difTcrcnt expression 
plasmids. As a result, glucose and acetate level monitoring were no longer preformed during 
fermentation. 
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b) Induction Of Fermentation Cultures 

Induction with IPTG (250 mg-IO gms, depending on the expression vector and 
experiment) was initiated 1-3 hrs after initiation of the glucose feed (30-50 (3D«H/ml). The 
growth rale after induction was monitored on a hourly basis. Aliquots (5-10 ml) of cells were 
harvested at the time of induction, and at hourly intervals post-induction. Optical density 
readings were determined by measuring the absorbance at 600 nm of 10 ^1 culture in 990 ix\ 
PBS versus a PBS control. The growth rate after induction was found to vary depending on 
the expression system utilized. 

c) Monitoring Of Fermentation Cultures 

i-ermentation cultures were monitored using the following control assays. 

i) Colony Forming Ability 

An aliquots of cells were removed from the cultures at each timepoint sampled 
(uninduced and al various times after induction) were serially diluted in PBS (dilution 1 = 15 
Ml cells/3 ml PBS. dilution 2 - 15 of dilution 1/3 ml PBS. dilution 3 - 3 or 6 |.il of 
dilution 2/3mls PBS) and 100 mI of dilution 3 was plated on an LB or TSA (trypiicase soy 
agar) plate. The plates were incubated ON al 37°C and then the colonies are counted and 
scored for macro or micro growth. 

ii) Phenotypic Characterization 

Colonics growing on LB or ISA plates (above) from uninduced and induced 
timepoinis were replica plated onto LB+kan, LB+chioramphenicol (for fermentations utilizing 
LysS or pACYCGro plasmids). LB+kan+ImM IPTG and LB plates, in this order. The plates 
were grown 6-8 hrs al 37°C and growth was scored on each plate for a minimum of 40-50 
well isolated colonies. The percentage of cells retaining the plasmid at time of induction (i.e., 
uninduced cultures immediately prior to the addition of IPTG) was determined to be the # 
colonies LB+Kan (or chloramphenicol) plate/# colonies LB plate X 100%. The percentage of 
cells with mutated pET plasmids was determined to be the H colonies LB+Kan+IPTG plate/# 
co/onies LB plate X 100%, Colonies on all LB plates were .scored morphologically for £. 
coli phenotype as a contamination control. Morphologically detectable contaminant colonies 
were not detected in any fermentation, 
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Hi) Recombinant BotA Protein Induction 

A total of 10 OD^ units of cells (e.g.. 200 m1 of cells at OD«„=50/ml) were removed 
from each timepoint sample to a 1.5 ml microfugc tube and pelleted for 2 min at maximum 
rpm in a microfuyc. The pellets were rcsuspendcd in I ml of 50 mM NallPO.. 0.5 M NaCl. 
40niM imidazole buffer (pH 6.8) containing I mg/ml lysozyme. The samples ucre incubated 
for 20 min at room temperature and stored ON at -70«>C. Samples were thavyed completely at 
room temperature and sonicated 2X10 seconds with a Branson .Sonitlcr 450 microtip probe 
at # 3 power setting. The samples were ccntrifuged for 5 min at maximum rpm in a 
microfuge. 

An aliquot (20 of the protein samples were removed to 20 mI 2X .sample buffer, 
before or after cenlrifugation. for total and soluble protein extracts, respectively, fiie samples 
were heated to y5°C for 5 min. then cooled and 5 or 10 ^li were loaded onto 12.5% 
SDS-I>A(JE gels. High molecular weight protein markers (BioRad) were also loaded to allow 
lor estimation of the MW of identitled fusion proteins. After electrophoresis, protein was 
15 detected either generally by staining gels with Coomassie blue, or spccitlcally. by blotting 
onto nitroccllulo.se (us described in Lx. 28) for Western blot detection of specific his-taggcd 
proteins utilizing a NiN l A-alkaline phosphatase conjugate exactly us described by the 
manufacturer (Qiagen). 

iv) Recombinant Antigen Purification 

At tiic end of each fermentation run. 1-10 liters of culture were harvested from the 
lermenter and the bacterial cells were pelleted by centrifugalion at 6000 rpm for 10 min in a 
•lAlO rotor (ficckman). The cell pellets were stored frozen at -70%" or utilized immediately 
without freezing. Cell pellets were resuspcnded to 15-20% weight to volume in resu.spension 
buffer (generally 50 mM NaPO,. 0.5 M NaCI. 40mM imidazole. pH 6.8) atid ly.scd utilizing 
either sonication or high pressure homogenization. 

I'or st)nication. the resuspcnsion buffer was supplemented with ly.sozyme to I mg/ml. 
and the suspension was incubated for 20 min. at room temp. I he sample was then frozen ON 
111 -70"»(". thawed and sonicated 4 X 20 .seconds at microtip maximum to tvduce viscosity. 
30 Tor homogenization. the cells were lyzed by 2 passes through a homogenizer (Rannie 

Mini-lab type 8.30 H) at 600 Bar. Cell lysatcs were claritied by ccntrifugation for 30 min at 
10.000 rpm in a JAIO rotor. 
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For IDA chromatography, samples were flocculated utilizing polyethyleneimine (PEI) 
prior to centrifugation. Cell pellets were resuspended in cell resuspension buffer (CRB: 50 
mM NaPOj, 0.5 M NaCI. 40 mM imidazole. pH 6.8) to create a 20% cell suspension (wet 
weight of cells/volume of CRB) and cell lysates were prepared as described above (sonication 
or homogenization). PEI (a 2% solution in dH,0, pH 7.5 with HCI) was added to the cell 
lysaie a final concentration of 0.2%. and stirred for 20 min at room temperature prior to 
centrifugation (8.500 rpm in JAIO rotor for 30 minutes at 4*'C). This treatment removed 
RNA. ONA and cell wall components, resulting in a clarified, low viscosity lysate ("PFF 
clarified lysate"). 

His-tagged proteins were purified from soluble lysates by metal-chelaic ai'finity 
chromatography using either a NiNTA resin (as described in Ex. 28) or an IDA ( iminodiacetic 
acid) resin as described below. 

IDA resin affinity purifications were performed utilizing a low pressure 
chromatography system (ISCO). A 7 ml (small scale) or 70 ml (large scale) Chelating 
Scpharoso Fast Flow (Pharmacia) affinity column was poured: in addition, a second guard 
column was poured and attached in line with the first column (to capture Ni ions that leached 
off the affinity column). The columns were washed with 3 column volumes of dIKO. The 
guard column was then removed and the affinity column was washed with 0.3 M NiSOj until 
resistivity was established, then with dH,0 until the resistivity returned to baseline. The 
columns were reconnected and equilibrated with cell resuspension buffer (CRB: 50 niM 
NaPOj. 0.5 M NaCI. 40 mM imidazole. pH 6.8). The clarified sample (in CRB) was loaded. 
Flow rates were 5 ml/min for small .scale columns and 20 ml/min for large scale columns. 
After sample loading, the column was washed with CRB until a baseline established and 
hound protein was eluted with elution buffer (50 mM NaPOj. 0.5 M NaCI. 800 mM 
imidazole. 20% glycerol. pH 6.8 or 8.0). Protein samples were stored at 4''C or -20''C. The 
yield of eluted protein was established by measuring the OD.^n of the elutions. with a I mg/ml 
solution of protein assumed to yield an absorbancc reading of 2.0. 

fhe IDA columns may be regenerated and reused multiple limes (>I0). To regenerate 
the column, the column was washed with 2-3 column volumes of I KO. then 0.05 M EDTA 
until all of the blue/green color was removed followed by a wash with dH,0. The IDA 
columns were sterilized with 0.1 M NaOH (using at least 3 column volumes but not more 
than 50 minutes contact lime wiih column packing material), then washed with 3 column 
volumes 0.05 M NaPO^. pH 5.0. then dU,0 and stored at room temperature in 20 % ethanol. 
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EXAMPLE 32 

Construction Of A Folding Chaperonc Ovcrexpression System 

Co-ovcrexpression of the £ coli GroEL/GronS folding chaperones in a cell expressing 
a recombinant foreign protein has been reported lo enhance the solubility oi' some foreign 
proteins that are otherwise insoluble when expressed in E, coli [Gragcrouu et al. ( 1 992) Proc. 
Natl. Acad. Sci. USA 89:10344]. The improvement in solubility is thought to be due to 
chaperone-medialed binding and unfolding of insoluble donaiured proteins, thus allowing 
multiple attempts for productive refolding of recombinant proteins. By overexpressing the 
chaperones, the unfolding/refolding reaction is driven by excess chaperone, resulting, in some 
cases, in higher yields of soluble protein. 

In this example, a chaperone overexpression system, eompatibic with pHT vector 
expression systems, was constructed to facilitate testing chaperone-mcdiated solubilization of 
C hofnlhwm type A proteins. This example involved the cloning of the GroHL/KS operon 
and construction of a pl.ysS-based chaperonc hypercxprcssion system. 

I he GroEL/GroRS operon was PCR amplified and cloned into the pCRScript vector as 
described in lixamplc 28. The following primer pair was used: 5*-C'GCAT 
ATCAATA ITCGTCCATTGCATGO- (SEQ ID NO:37) [5' primer, start codon of groES 
gene converted to NJe\ site (underlined)] and 5 -GG AACrCTT GCAGGGrAAr T/ICATCATG 
(SEQ ID NO:38) (3' primer, stop codon of groEL gene italicized, engineered HindlU site 
underlined). Following amplification, the chaperone operon was excised as an AV/t'I////«dlII 
fragment and cloned into pET23b digested with NcM and //mdlll. This construction places 
the Gro operon under the control of the T7 promoter of the pET23 vector. The desired 
construct was confirmed by restriction digestion. 

1 he f 7 promoter-Gro operon-T7 terminator expression ca.ssette was then exei.sed as a 
D^IWBxpEl (filled) fragment and cloned into Bamlil (compatible with lif^lU)/ HindlU (filled) 
cleaved pLysS plasmid (this removed the T7 lyso?.yme gene). The resulting construct was 
designated pACYCGro. since the plasmid utilizing the pACYCI84 origin from the ply.sS 
plasmid. Proper construction was confirmed by restriction digestion. 

pACYCGro was transformed into BL2I(DE3). cultures were grown and induced with 
I mM IPTG as described in preceding examples. Total and soluble protein extracts were 
generated from cells removed before and after IPTG induction and were resolved on a 12.5 % 
SDS-PAGE gel and stained with Coomassie blue. This analysis revealed that high levels of 
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soluble GroEl and GroES proteins were made in the induced cells. These results 
demonstrated that ihe chaperone hypcr-cxpression system was functional. 

EXAMPLE 33 

■ 5 CJrowih Of BolA/pACYCGro Cell Lines In Fermentalion Cultures 

Induction of BL2UDE3) cells lacking Ihe LysS plasmid which contained BotA 
expression constructs grown in shaker tlask or fermentation culture resulted in the expression 
of primarily insoluble BolA protein. l-ermeniation cultures were performed lo determine if 

10 the simultaneous overexpression of the Gro operon and recombinant C hoiulimim type A 
proteins (BotA proieins) resulted in enhanced solubility of the rccombinani RolA protein. 
This example involved the fermentation of pHisBotA(syn)kan laclq ITbc/pACYCGro 
BL2l(i:>n3l and pHisBotA(syn)kan laclq H/pACYCGro BIJ](DE3) cell lines. The 
fermentations were repeated exactly as described in Example 31. Chloramphenicol (34 

15 i-ig/ml) was included in the feeder and fermentation cultures. 

a) Fermentation Of pHisBotA(syn)kan laclq TThic/pACVCGro 
BL2I(DE3) Cells 

I'or fermentation of cells containing plasmids comprising the T7lac promoter. 

20 induciion was with 2 gms [PTG at 1 hr post initiation of glucose feed. The OD,.„nVvas 35 at 
time of iiKluciion. then 48,5, 61.5. 67 at 1-3 hrs post induction. Viable colony counts 
decreased fmm 0-3 hr induciion (21 (I3K 0. 0, 0: dilution 3 utilized 3 |al of dilution 2 cells] 
with numbers in parenthesis for the indicating microcolonies. 01* 28 colonies scored at the 
lime of induction, 23 retained the pHisBoiA(syn)kan laclq T7lac plasmid (kan resistant), 22 

25 contained the chaperone plasmid (chloramphenicol resistant) and no colonies at induction 
grew on IPTG+Kan plates (no mulations detected). These results were indicative of very 
strong promoter induction, since colony viability dropped inmiediaiely after induciion. 

Total and soluble extracts were resolved on a 12.5% SHS-PAGE gel and stained wiih 
Coomassie. High level induction of Gro chaperones was obsen^ed, but very low level 

30 expression of soluble BotA protein was observed, increasing from I to 4.0 hrs post induciion 
(no expression detected in unindueed cells). The dramatically lower expression of the BotA 
antigen in the presence of chaperone may be due to promoter occlusion {i.e. the stronger T? 
promoter on the chaperone plasmid is preferentially utilized). 
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b) Fermentation Of pHisBotA(syn)kan laclq Til pACYCGro 
BL21(DE3) Cells 

A fermcntaiion utilizing the T7-driven BotA expression plasmid was performed. 
Induction was with I gm IPTG at 2 hrs post initiation of glucose feed. The OD,,, was 4 1 at 
time of induction, then 51.5. 61.5. 61.5 and 66 at 1-4 hrs post induction. Viable colony 
counts decreased from (M hrs induction [71. I (34). 1(1). ]. (): dilution 3 utilized 6 ni 
dilution 2 cells) with numbers in parenthesis for the uninduccd timepoini indicating 
microcolonies. Of 65 colonies scored at the time of induction, all 65 retained both the 
pHisBotA(s\'n)kan laclq T7 plasmid (kan resistant) and the chaperonc plasmid 
(chloramphenicol resistant) and no colonies at induction grew on IPTG+Kan plates (no 
mutations detected). 

loial and soluble extracts were resolved on a 12.5% Sl)S-l»AGIi gel and stained with 
Cooma.ssic. High level induction of Oro chaperones and moderate level expression of soluble 
BotA protein was observed, increasing from I to 4.0 hrs po.st induction (no expression 
detected in uninduced cells). 

A Plil-clarilied lysate (0.2% llnal cocnentration PF.I) (850 ml from 130 gm cell pellet 
(2 liters fermentation harvest)) was purified on a large scale IDA column. A total of 78 mg 
of proteni was eluled. l-.xtracts from the purification were resolved on a 12.5% SDS-PAGK 
gel and stained with C oomassie. The eluiion was found to contain an approximately 1:1 mix 
of BolA/chaperone protein (Figure 32). PEI ly.satcs prepared in this manner were typically 16 
()IX„/nil. This was estimated to be 8 mg proicin/ml of ly.saie (by BCA a.s.say). Thus, the 
eluled recombinant BotA protein represented 0.55% of the total soluble cellular protein 
applied to the column. 

In Figure 32. lane I contains molecular weight markers, lanes 2-9 contain extracts 
from pilisBotA(syn)kan laclq I2/pACYCGro/BL2l(Dr-3) cells before or during purification 
on the IDA column. Lane 2 contains total protein extract; lane 3 contains soluble protein 
extract: lanes 4 and 5 contain PEI-clarified ly.sates (duplicates): lanes 6 and 7 contain flow- 
through from the IDA column (duplicates) and lanes « and 9 contain IDA column elute (lane 
9 contains l/IO the amount applied to lane 8). 

The.sc results demonstrate, that although the majority of the BotA protein produced 
was insoluble. 20 mg/liter of soluble recombinant BotA protein can be purified utilizing the 
pHisBotA(.syn)kan laclq I7/pAC YCGro/BL2l(DE3) expression system. 
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EXAMPLE 34 

Purification Of Recombinant BolA Protein From Folding Chaperones 

In this example (»f size exclusion chromatography was used to purify the recombinant 
Hot A protein away from the folding chaperones and imidazole present in the IDA-purified 
material (Ex. 33). 

To enhance the solubility of the recombinant BolA protein during scale-up. the protein 
was co-expressed with folding chaperones (Ex. 33). As observed with the recombinant BoiB 
protein (Uxample 40 below), the folding chaperones co-eluted with the recombinant BoiA 
protein during the Ni-IDA purification step. Because the recombinant BotA and BotB 
proteins have .similar molecular weights (about l/IO the size of the non-reduced folding 
chaperone) and the imidazole step gradient strategy was unsuccessful in purifying BotB away 
from the folding chaperonc (see llx. 40). size exclusion chromatography was examined for the 
ability to purify the recombinant BotA protein away from the lolding chaperones. 

A column (2.5 .\ 24 cm) containing Sephacryl S-100 HR (Pharmacia) was poured (bed 
volume • 1 10 ml». Proteins having molecular weights greater than 100 K are expected to elute 
in the void volume under these conditions and smaller proteins should be retained by the 
beads and elute at different times, depending on their molecular weights, fo maintain 
solubility of the purified BotA protein, the Sephacryl column was equilibrated in a buffer 
having the .same salt concentration as the buffer used to elute the BotA protein from the IDA 
column {i.e.. 50 mM sodium phosphate. 0.5 M NaCI. 10% glycerol: all reagents from 
Mallinkrodt. Chesierneld. MO). 

l ive milliliters of the IDA-purified recombinant BotA protein (Ex. 33) was filtered 
Ihrough a 0.45 fi syringe filter, applied to the column and the equilibration buffer was 
pumped through the column at a fiow rate of 1 ml/minute. Eluted proteins were monitored 
by absorbance at 280 nm and collected either manually or with a fraction collector (BioRad). 
Appropriate fractions were pooled, if necessary, and the protein was quantitated by absorbance 
at 280 nm and/or BCA protein assay (Pierce). The isolated peaks were then analyzed by 
native and/or .SDS-PAGE to identify the proteins present and to evaluate purity. The folding 
chaperonc eluted first, followed by the recombinant BolA protein and then the imidazole 
peak. 

SD.S-PAGE analysis (12.5% polyacrylamide. reduced samples) was used to evaluate 
the purity ol the IDA-purified recombinant BotA protein before and after S-lOO purification. 
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Fiyure 33 shows the difference in purity before and after the S-lOO purification step. In 
Figure 33. lane I contains molecular weight markers (BioRad broad range). Lane 2 shows 
the IDA-purified recombinant BotA protein preparation, which is contaminated with 
signiticani amounts of the folding chaperone. Following S-lOO purification, the amount of 
folding chaperone present in the BotA sample is reduced dramatically (lane 3), Lane 4 
contains no protein (i.e„ it is a blank lane): lanes 5-8 contain samples of IDA-purified 
recombinant BolB and Boti: proteins and are discussed infra, 

Kndotoxin levels in the S-lOO purified BotA preparation were determined using the 
LAL a.ssay (Associates of Cape Cod) as describe in Example 24. I lie purified BolA 
preparation was found to contain 22,7 to 45.5 EU/mg recombinant protein. 

These results demonstrate that size exclusion chromatography was successful in 
purilVing the recombinant BolA protein from folding chaperones and imidazole following an 
initial IDA purification step. Furthermore, these results demonstrate that the S-lOO purified 
BotA protein was substantially free of endotoxin. 

EXAMPLE 35 

Cloning And Expression Of The C Fragment 
Of The C\ hoiulinum Serotype B Toxin Ciene 

riic ( '. hoiulinum type B neurotoxin gene has been cloned and sequenced IWhelan ci 
ul. (1992) Appl. [jiviron. Microbiol. 58:2345 and Hutson at al, (1994) Curr. Microbiol. 
28:1011. I hc nucleotide sequence of the toxin gene derived from the i:klund 17B .strain 
(ATCC* 25765) is available from ihc EMBL/GenBank sequence data banks under the 
accession number X7I343: the nucleotide sequence of the coding region is listed in SEQ ID 
NO:39. The amino acid sequence of the C. hoiulinum type B neurotoxin derived from the 
strain Eklund 17B is listed in SEQ ID NO:40. The nucleotide sequence of the C hoiulinum 
serotype B toxin gene derived from the Danish strain is listed in SEQ ID N():4I and the 
corresponding amino acid sequence is listed in SEQ ID NO:42. 

I he DNA sequence encoding the native i\ hotulinum serotype B C fragment gene 
derived from the Eklund I7B strain can be expres.sed using the pFTHisb vector: the resulting 
coding region is listed in SF-Q If) NO:43 and the corresponding amino acid sequence is listed 
in SEQ H) NO:44. The DNA sequence encoding the native C. hoiulinum serotype B C 
fragment gene derived from the Danish strain can be expressed using the pETHisb vector; the 
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resulting coding region is listed in SEQ ID NO:45 and the corresponding amino acid sequence 
is listed in SEQ ID NO:46. The C frgamenl region from any strain of C hotulimm serotype 
B can be amplified and expressed using the approach illustrated below using the C fragment 
derived from C. hoiuUmim type B 2017 strain. 

The C. hoiulimtm type B neurotoxin gene is synthesized as a single polypeptide chain 
which is processed to form a dimer composed of a light and a heavy chain linked via 
disulfide bonds: the type B neurotoxin has been reported to exist as a mixture of predominatly 
single chain with some double chain (Whelan e/ al.. supra). The 50 kD carboxy-tcrmina! 
portion of the heavy chain is referred to as the C fragment or the H,- domain. Expression of 
the C I'ragmcnt of C. hotulinum type B toxin in heterologous hosts (e,^., E. coli) has not been 
previously reported. 

The native C fragment of the ( ' homlimtm serotype B toxin gene was cloned and 
expression constructs were made to facilitate protein expression in L, coli. This example 
involved PCR amplification of the gene, cloning, and construction ol' expression vectors. 

I he C IVagment of the C. hotulinum serotype B (BotB) toxin gene was cloned using 
the protocols and conditions described in Example 28 for the isolation of the native BotA 
gene, flic C hotulinum l\pe B 2017 strain was obtained from the American Type Culture 
Collection (A IC'C -17843). The following primer pair was used to amplify the BotB gene: 
5'-r( iC CATCKiC I'GATACAATACTAATAGAA ATCi-3* [5' primer, engineered A'cy;I site 
underlined (SLQ ID NO:47)l and 5 -C}C AAG CTl TTAlTC/KGJCC^CCCTVCKTC\r \y 
primer, engineered //mdlli site underlined, native gene termination codon italicized* (SEQ ID 
N():4X)|. Alter cloning into the pC:R.script vector, the A'AfI(niled)////;7dni fragment was 
cloned into pETMisb vector as described for BotA C fragment gene in Example 28. The 
resulting construct was termed pHisBotB. 

pHisBotB expresses the BotB gene sequences under the transcriptional control of the 
T7 lac promoter and the resulting protein contains an N-terminal lOXHi.s-tag affinity tag. The 
pMisBotB expression construct was transformed into BL2l(DE3) pLysS competent cells and I 
liter cultures were grown, induced and his-iagged proteins were purified utilizing a NiNTA 
resin (eluted in low pi I elution buffer) as described in Example 28. Total, .soluble and 
purified proteins were resolved by SDS-I^AGE and delected by Coomassie staining and 
Western blot hybridization utilizing a chicken anti-C. hotuUmmi .serotype B toxoid primary 
antibody (generated by immunization of hens using C, bondinum serotype B toxoid as 
described in Example 3). Samples of FiotA and BotE C fragment proteins were included on 
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the gels lor MW and immunogeniciiy comparisons. Strong immunoreactiviiy to only the 
BotB protein was detected with the anti-C. hotulinum serotype B toxoid antibodies. The 
recombinant BotB protein was expressed at low levels (3 mg/liter) as a soluble protein. The 
purilied BotB protein migrated as a single band of the predicted MW {i.e.. -^QkD). 

These results demonstrate the cloning of the native ( : hoivlinum .serotype B C 
fragment gene, the expression and puritieation of the recombinant BotB protein as a .soluble 
his-tagged protein in £. coli. 



20 



25 



30 



EXAMPLE 36 

Generation Of Neutralizing Antibodies Using The Recombinant pHisBotB Protein 

The ability of the purified pHisBot protein to generate neutralizing antibodies was 
examined. Nine BALBc mice were immunized with BotB protein (purilied as described in 
l-x. 3.M using Ciorbu CiMOP adjuvam (CC Biotech). I hc low pi I elution was mi.sed with 
(ierbu adjuvant and u.sed to immunize mice. Each mou.se received a subcutaneous injection 
ol" 100 Ml imtiycn/adjuvant mi.x (15 pg antigen + 1 ng adjuvant) on day 0. Mice were 
subcutaneouslv boosted as abo\c on day 14 and bled on da> 28. Mice were subsequently 
boosted 1-2 weeks after bleeding and were then bled on day 70. 

Anti-r. hotulinum serotype B toxoid titers were determined in day 28 .scrum from 
individual mice from each group using the F.LJSA protocol outlined in lixample 29 with the 
exception that the plates were coated with C. hniulinum .serotype B toxoid, and the primarv 
antibotK was a chicken anti-C. boiulinum serotype B toxoid. Seroconversion |relativc to 
control mice immunized with pHisBoiH antigen (described below)) was oh.servcd with all 9 
mice immunixed with the purified pHisBotB protein. 

The ability of the anti-BotB antibodies to neutralize native homlimim type B toxin 
was tested in a mouse-C. horiilimiin neutralization model using pooled mouse serum (see I-x. 
23b). I hc LD,„ of purilied C. hniulinum type B toxin complex (Dr. liric .l()hn.son. University 
of Wisconsin. Madison) was determined by a intraperitoneal (IP) method fSchantz and Kautler 
(1978). supni] using 18-22 g female ICR mice. The amount of neutralizing antibodies present 
in the serum of the immunized mice was determined using serum antibody titrations. I hc 
various serum dilutions (0.01 ml) were mixed with 5 LD^, units of C. hoiulinum type B toxin 
and the mixtures were injected IP into mice. I he neutralizations were performed in duplicate. 
The mice were then observed for signs of botulLsm for 4 days. Undiluted serum (day 28 or 
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Clay 70) was found to protect 100% of the injected mice while the l.IO diluted serum did not. 
This corresponds to a neutralization titer of 0.05-0.5 ID/ml. 

These results demonstrate that seroconversion occurred and neutralizing antibodies 
were induced when the plIisBotB protein was utilized as the immunogen. 

EXAMPLE 37 

Construction Of Vectors To Facilitate li.xprcssion 
Of His- Tagged BotB Protein In Fermentation Cultures 

A number of expression vectors were constructed lo lacilitatc the expression of 
recombinant DotB protein in large scale fermentation culture. These constructs varied as to 
the strength of the promoter utilized (T7 or T7lac) and the presence of repressor elements 
(laclq) on the plasmid. The resulting constructs varied in the level of expression achieved and 
in plasmid stability which facilitated the selection of a optimal expression sy.stem for 
fermentation scaleup. 

The BotB expression vectors created for fermentation culture were engineered to 
utilize the kanamycin rather than the ampiciilin resi.stancc gene, and contained either the T7 or 
T7lac promoter. %Nith or without the laclq gene for the reasons outlined in Example 30. 

In all ca.ses. the protein expressed by the various expression vectors is the pHisBot B 
protein described in Example .15. with the only differences between clones being the aheration 
of various regulatory elements. Using the designations outlined below, the pHi.sBotB clone 
(l:x. ?5) is equivaleni to plIisBotB amp T71ac. 

a) Con.struction Of pllisUotB kan r7Iac 

pHisBotB kan T7lac was constructed by insertion of the /?.tr/II////«dIII fragmem of 
Pl IisBotB which contains the BotB gene sequences into the pPA 1 870-2680 kan T7lac vector 
which had been digested with Z?.^'/I1 and HindlU (the pl>A 1 870-2680 kan T7lac vector contains 
ilie pI;T24 kan gene in the pET:."! vector, such that no laclq gene is presem). Proper 
construction of pHisBotB kan T71ac was eonlirmed by restriction digestion. 
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b) Construction Of pHisBotB kan laclq T7lac 

pHisBotB kan laclq T7Iac was constructed by insertion ofihe %/II/f/mdlII fragment 
ofpHisBotB which contains the BolB gene sequences into similarly cut pET24a vector. 
Proper construction of pHisBotB kan laclq Tlhc was confirmed by restriction digestion. 

5 

c) Construction Of pHisBotB kan iaclq T7 

pHisBolB kan laclq T7 was constructed by inserting the Ndel/Xhol Iragmcnl from 
pl-lisBotH kan laclq T7lac which contains the BolB gene sequences into similarly cleaved 
pPA 1 870-2680 kan laclq T7 vector (this vector contains the T7 promoter, the same N- 
10 terminal his-lag as the Boi constructs, the (*. difficile toxin A insert, and the kan laclq genes: 
this cloning replaces the ( '. Jijjficilc loxin A insert with the BotB insert). IVoper construction 
was confirmed by restriction digestion. 

l:\prcssion of recombinant BotB protein from these expression vectors and purification 
of the BotB protein is described in Fxample 38 below. 

15 

EXAMPLE 38 

i'crmeniation And Purification Of Recombinant BolB Protein IJlilizing The 
pI lisBotB kan iaclq T7lac. pHisBoiB kan r7lae And pHisBolB kan laclq 17 Vectors 



20 The pHisBoiB kan laclq T7lac, pHisBotB kan 'r7lac and BoiB kan laclq T7 constructs 

|all transformed into the BI21(Dl-3) .sirain| were grown in fermentation cultures to determine 
the utility of the various constructs lor large scale expression and purification of soluble BotB 
protein. All lermentations were performed as described in Rxample 31. 

25 a) Fermentation Of pHisBotB kan laclq T7iac/BI21(DE3) Cells 

The fermentation culture was induced 45 min post start of glucose feed with I gm 
IPTCi (final concentration = 0.4 mM). pH was maintained at 6.5 rather than 7.0. The ODm„, 
was 27 at time oi' induction, then 35, 38, and 40 at 1-3 hrs po.sl induction. Duplicate platings 
of diluted I hr induction samples (dilutions were prepared us described 31. dilution 3 
30 utilized 3 mI of dilution 2 cells) on fSA and LBi-kan plates yielded 80 TSA colonics and 81 
kan colonies (90% kan resistant). 

Total and soluble protein extracts were resolved on a 12.5% SDS-PAGE gel and total 
protein was detected by staining with Coomassie blue. l,ow level induction of in.soluble Bot 
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B protein was observed, increasing from I lo 3 hrs post induction (no expression was detected 
in uninduced cells). 

b) Fermentation Of pHisBotB kan T7lac/BI21(DE3) Cclb 

The fermentation culture was induced I hr post start ol' glucose leed with 2 gm iPTG 
(final concentration = 0.8 mM). pH was maintained at 6.5 rather than 7.0. The OD«„ was 
24.5 M time of induction, then 31.5. 32. and 33 at 1-3 hrs post induction, respectively. 
Duplicate platings of diluted 0 hr and 2 hr induction samples (dilutions were prepared a.s 
described Ex. 31; dilution 3 utilized 3 mI of dilution 2 cells) on T.SA and LB+kan plates 
yielded 32 TSA colonies and 54 kan colonies (all kan resistant) for uninduced ceils, and I 
TSA colony and 0 kan colonies 2 hr post induction. These results were indicative of strong 
induction, .since viable counts decreased dramatically 2 hrs post induction. 

Total and soluble extracts were resolved on a 10% SD.S-PAGE gel and total protein 
was detected by staining with Coomassic blue. Moderate induction of insoluble BotB protein 
15 was oKscrved. increasing from I to 3 hrs post induction (no expression was detected in 
uninduced cells). 

c) Fermentation Of pHi.sBotB kan laclq T7/BI21(DFJ) Cells 

The fermentation was induced 2 hr post start of gluco.so feed with 4 gm IPTCi (llnal 
?0 concentration = 1.6 niM). pU was maimained at 6.5 rather than 7.0. The OD„„ was 45 ai 
time of induction, then 47. 50. and 50 and 55 ai 1-4 his post induction, respectively. Viable 
colony counts decreased after induction (96. 1. 1. 2. 3; dilution 3 utilized 3 mI of dilution 2 
cells). Of 63 colonies .scored at the lime of induction, all 63 retaining the BoiB pla.smid (kan 
resistant) and no colonies at induction grew on IPTG + Kan plates (no mutations detected). 

^"'^ •'*"'"blc extracts were resolved on a 12.5% SDS-PAGI£ gel and total protein 
was detected by .staining with Coomassie blue. Moderate level induction of insoluble BoiB 
protein uas observed. increa.sing from 1 to 4 hrs post induction (lower level expression was 
detected in uninduced cells, since the T7 rather than T71ac promoter was utilized). 



.30 



d) Purification Of pHisBolB Protein From pHissBotB amp 

T7lac/BI21(DE3) Cells 
Soluble recombinant BotB protein was purified utilizing NiNTA resin from 80 ml of 
cll lysate generated from cells harvested from a pHisBotB fermentation fusing the pHisBotB 
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amp 171ac/BI2l(DE3) strain]. As predicted from ihe small scale results above, the majority 
of the induced protein was insoluble. As well, the cluied material was contaminated with 
multiple £ cnii contaminant proteins. A Coomassie bluc-siained SDS-PAGR gel containing 
extracts derived from pHisBolB amp T7lac/B121(Dr'3) cells before and during purification is 
5 shown in Figure .14. In F'igure 34. lane 1 contains broad range protein MW markers 

(BioRad). Lanes 2-5 contain extracts prepared from pHisBolB amp r7Iac/B121(DI:3) cells 
grown in fermentation culture; lane 2 contains total protein: lane 3 contains soluble protein: 
lane 4 contains protein which did not bind lo the NiNTA column (i.e.. the ilow-through) and 
lane 5 contains protein eluted from the NiNTA column. 

'0 Similar results were obtained using a small scale IDA column utilizing a cell lysate 

from the pllisBoiD kan laclq T7 fermentation described above. 250 mis ofu 20% w/v PFI 
clarified lysaie (50 gms cell pellet) of boiB kan laclq T7/B121(Dl-:3) cells were purilled on a 
small scale IDA column, fhe total yield of eluted proiein was 21 nig protein (assuming 1 
my/ml solution ■■■ 2 ()D.x,/ml). When analyzed by SDS-PACil- and Coomassie staining, the 

15 BotB protein was found to comprise approximately 50% ofihc cluied protein with the 
remainder being a ladder of £. coli proteins similar to that observed with the NiN'IVX 
purillcation. 

The NiN fA alkaline phosphatase conjugate was utilized to detect his-taggcd proteins 
on a Western blot containing total, soluble, soluble (PHI clarified), soluble (after IDA column) 
20 and cliiiion samples from the IDA column purification. The results demonstrated that a small 
percentage of BotB protein was soluble, that the soluble protein was not precipitated by PEl 
treatment and was quantitatively bound by the IDA column. Since a I liter fermentation 
harvest yielded a 67.5 gm cell pellet, this indicated that the yield of soluble affmity purified 
BotB protein from the IDA column was 14 mg/litcr. 

25 

EXAMPLE 39 

(?o-Exprcssion Of Recombinant Botii Proteins 
And F'olding Chapcrones In Fermentation Cultures 

-^^^ Termenlations were performed to determine if the simultaneous overexpression of 

folding chapcrones {i.e.. the Ciro operon) and the BotB protein resulted in enhanced solubility 
of the Bot B protein. This example involved fermentation of the pHisBoiBkan laclq 
r71ac/pACYCGro BL21(DE3). pHisBotB kan T7lac/pA(:YrGro B12I(DE3) and pHisBolBkan 
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laclq 111 pACYCGro BL21(DE3) cell lines. Fermentation was carried out as described in 
ENamplc 31: 34 Mg/ml chloramphenicol was included in the feeder and fermentation cultures. 

a) Fcrmentalion Of pHisBotBkan laclq T7lac/pACYCGro 
BL21(D£3) Cells 

Induction was with 4 gms IPTG at 1 hr 15 min post initiation of the glucose feed. 
The ()D,«, xvas 38 at lime of induction, then 50. 58.5. 62 and 68 at 1-4 hrs post induction. 
Viable colony counts decreased during induction (24. 0. 0. 2. 0 at 0-4 hr induction: dilution 3 
utilized 3 Ml of dilution 2 cells). Of 24 colonies scored at the time of induction. 24 retained 
the BotB plasmid (kan resistant). 24 contained the chaperone plasmid (chloramphenicol 
resistant) and no colonies at induction grew on IPTG-. Kan plates (no mutaiions detected). 

lotal and soluble extracts were resolved on 12.5% SDS-PAGF gels and were either 
stained with (.oomassie blue or subjected to Western blotting (his-tagged proteins were 
dclcetcd utilizing the NiNTA-alkaline phosphatase conjugate). This analy.sis revealed that the 
(iro chaperoncs were induced to high levels, but very low level expression of soluble BotB 
protein w«s observed, increasing from 1 to 4.0 hrs post induction (no expression detected in 
unindueed cells, induced protein delected only on Western blot). The dramatically lower 
expression of BotB protein in the presence of chaperone may be due to promoter occlusion 
(/.t-.. the stronger f? promoter on the chaperone plasmid was preferentially utilized). 

b) Fermentation Of pHlsBotB kan T7lac/pACYCGro/BI21(DE3) 
Cells 

Induction was with 4 gms IPTG at 1 hr post initiation of the glucose feed. The OD^ 
was .-^.V.s at time of induction, then 44. 51. 58.5 and 69 at 1-4 hrs post induction. Viable 
colon) counts decreased after 2 hrs induction (43. 65. 74. 0 (70). 0 (70) at 0-4 hr induction: 
bracketed numbers represent microcolonies: dilution 3 utilized 3 ^1 of dilution 2 cells). Most 
colonies at induction retained the BotB pla.smid (kan resistant)and the chaperone plasmid 
(chloramphenicol resi.stant) and no colonies at induction grew on IPTG+Kan plates (no 
mutations detected). 

l otal and soluble extracts were resolved on a 12.5% SDS-PACiK gci and .subjected to 
Western blotting: his-tagged proteins were detected utilizing the NiNTA-alkaline phosphatase 
conjugate. This analysis revealed that the Gro chaperoncs were induced to high levels and 
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low level expression of soluble Bot B protein was observed, increasing from I lo 4.0 hrs post 
induction (no expression detected in uninduced cells). 

A small scale IDA purification of BotB protein from a 250 ml PHI clarified 15% wA' 
extract (37.5 ym cell pellet) yielded approximately 12.5 mg protein, of which approximately 
50% was BotB protein and 50% was firoEL chapcrone (assessed by Coomassie staining of a 
10% SDS-PAGE gel). The NiNTA alkaline phosphatase conjugate was utilized to detect his- 
lagged proteins on a Western blot containing tolaL soluble, soluble (PHI clarified), soluble 
(after IDA column) and elution samples from the IDA column purification, fhe results 
demonstrated that all of the BotB protein produced by the pMisBotB kan 
T71ac/pACYC(jro/BI2l(DE3) cells was soluble: the BotB protein was not precipitated by PEI 
treatment and was quantitatively bound by the IDA column. Since a 1 liter fermentation 
harvest yielded a 75 gm cell pellet, this indicated that the yield of soluble affiniiy purified bot 
B protein from this I'ermeniation was 12.5 mg/liter. These results also demonstrated that 
additional purification steps are necessary to separate the chaperonc proteins from the BotB 
protein. 

c) Fermentation Of pHisBotBkan laclq 
T7/pACYC(;ro/BL2l(DE3) Cells 

Induction was with 4 gms IPTG at 2 hr post initiation of the glucose feed, flie C)D,^„, 
was 46 at time of induction, then 56. 63, 69 and 71,5 at U4 hrs post induction. Viable colony 
counts decreased after induction (58. 3(5), 3. 0. 0 at (M hr induction: bracketed numbers 
represent microcolonies: dilution 3 utilized 3 o(" dilution 2 cells). All (5.V53) colonies 
scored at the time of induction retained the BotB plasmid (kan resistant) and the chapcrone 
plasmid (chloramphenicol resistant) and no colonies at induction grew on IP TG+kan plates 
(no mutations detected). 

Total and soluble extracts were resolved on a 10% SDS-PA(}E gels and Western 
blotted and his-tagged proteins were detected utilizing the NiN'l A-alkaline phosphatase 
conjugate. This analysis revealed that the Gro chaperones were induced to high levels 
(observed by ponceau S .staining), and a much higher expression of soluble Bot B protein 
(compared lo expression in the pHisBotB kan T7lac/pACYCGro fermentation) was observed 
at all iimep4>ints. including uninduced cells (some increase in BotB protein levels were 
observed after induction). 
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A small scale IDA purification of BolB protein from a 100 ml PEI clarified 15% w/v 
extract (15 gm cell pellet) yielded approximately 40 mg protein, of which approximately 50% 
was 13otB protein and 50% was GroEL chaperone. as assessed by Coomassie staining of a 
10% SDS-PAGE gel. The NiNTA alkaline phosphatase conjugate was utilized to detect his- 
lagged proteins on a Western blot containing total, soluble, soluble (PEI clarified), soluble 
(after IDA column) and elution samples from the IDA column purification. The results 
demonstrated that a significant percentage (i.e., --10-20 %) of BotB protein was soluble, that 
the sokibilized protein was not precipitated by PEI treatment and was quantitatively bound by 
the IDA column. Since a 10 liter fermentation yielded a 108 gm cell pellet, this indicated that 
the yield of soluble affinity purified BotB protein from this fermentation was 144 mg/liier. 

In a scale up experiment. 2 liters of a 20% w/v PEI clarified lysaic of pHisBoiB kan 
lack) r7/pACYCGro/BL2UDE3) cells were purified on a large scale IDA column, llie 
purification was performed in duplicate. The total yield of BotB protein was 220 and 325 
mgs protein in the two experiments (assuming I mg/ml solution ^ 2.0 OD,,,/ml). This 
represents 0.7% or 1,0%. respectively, of the lotal soluble cellular protein (assuming a PEI 
ly.state having a concentration of 8 nig protein/ml and that the eluied material comprises a 1:1 
mixture ol' BoiB and folding chaperone). The NiNTA alkaline phosphatase conjugate was 
utilized to detect his-tagged proteins on a Western blot containing total, soluble, .soluble (PEI 
clarified), soluble (after IDA column) and eluiion samples from the IDA column purification. 
These results demonstrated that a significant percentage (/.c, •10-20 %) of the BotB protein 
was soluble, that the solubilized protein was not precipitated by PEI treaUncnl and was 
i|uaniitati\ely bound by the IDA column. .Since a 1 liter fermentation harvest yielded a 108 
gm cell pellet, this indicated that the yield of soluble affinity purified BotB protein from the 
large scale purification was 60 mg or 89 mg/liter. These results also demonstrated that 
lurther purification would be necessary to remove the contaminating chaperone protein. 

The above results provide methodologies for the purification of .soluble BotB protein 
from fermentation cultures, in a form contaminated predominantly with a single £. cali 
protein (the folding chaperone utilized to enhance solubility). In the next example, methods 
are provided for the removal of the contaminating chaperone protein. 
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EXAMPLE 40 

Removal Of Contaminating Folding Chaperone Protein 
From Puritled Recombinant C. hohilimm Type B Protein 

In this example size exclusion chromatography and ultrafiltration was used to purity 
recombinant BotB protein from the folding chapcrones and imidazole in IDA-purified 
material. 

To enhance the solubility of the recombinant BotB protein during scale-up. the protein 
was co-expres.scd with folding chaperones (see lix. 39). During the Ni-IDA purification step, 
the folding chaperones co-eluted with the BotB protein in 800 niM imidazole: therefore, a 
.second purification step was required to isolate the BotB free of folding chaperones. I.ane 3 
of Figure 35 contains proteins elutcd from an IDA column to which a lysatc of pIlisBotB kan 
laclq •r7/pACYCCiro/BL2l(DC3) cells had been applied: the proteins were resolved on a 
4-15% polyacryiamidc pre-cast gradient gel (Bio-Rad. Hercules. CA) run under native 
conditions and then stained with Coomassic blue. In l igure 35. lanes 1 and 4 contain proteins 
present in peak 1 and peak 2 from a Sephacrx'l .S-100 column run as described hekm: lane 2 
is blank. 

As seen in lane 3 of Figure 35. the IDA-purified sample consists primarily of the 
folding chaperones and the BotB protein, flic fact that the chaperones and the Bot B antigen 
appear as two distinct bands under native conditions suggested they were not complcxcd 
together and therefore, it .should be possible to separate them, using either a gradient of 
imidazole ctincentrations or size exclusion methods. 

In order to determine whether a gradient of imidazole concentrations could be used to 
separate the chaperone from the BotB protein, a step gradient using imidazole at 200. 400. 
600. and 800 niM in 50 mM sodium phosphate. 0.5 M NaCI and 10 % glycerol. pH 6.8 was 
applied to an IDA column (containing proteins bound from a lysatc of pMisBoiB kan laclq 
T7/pAC:YCGro/BL21(DU3) cells). By narrowing the range of imidazole concentrations, it 
was hoped that the BotB and chaperone proteins would differentially cliite at dilTcrent 
concentrations of imidazole. Eluted proteins were monitored by aksorbancc at 280 nm and 
30 collected either manually or with a fraction collector (BioRad). Protein was found to elute at 
200 and 400 mM imidazole only. 

I-igure 36 shows a Coomas.sie stained SDS-PAUIZ gel containing protein eluted during 
the imidazole step gradient. I.ane I contains broad range MW markers (BioRad). Lane 2 
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contams BotB protein purified by IDA chromatography of an extract of pHisBotB/BL2l(DE3) 
pLysS cells grown in shaker nask culture (U:. no co-expression of chaperoncs: Ex. 35). 
Lane 3 contain.s a 20% w/v PEI clarilled lysate of pHi.sBotB kan laclq 
r7/pACYCGro/BL2irDE3) cells (i.e.. the ly.sate prior to purillcation by IDA 
chromatography). I,anes 4 and 5 contain protein which eluted at 200 or 400 mM imidazole, 
rcspectixcly. Lane 6 is blank. Lanes 7 and 8 contain 1/5 the load present in lanes 4 and 5. 

As shown in Figure 36. both the chaperone and the BotB protein eluted in 200 niM 
imidazole, and more chaperone elutes in 400 mM imidazole, however no concentration of 
imidazole tested permitted the elution of BotB protein alone. Consequently, no signilicant 
purillcation was achieved using imidazole at these concemrations. 

Because ol thc considerable difference in molecular weights between the folding 
chaperone. which is a multimer with a total molecular weight around 400 kl) (as determined 
on a Shodcx KB 804 sizing column by HPLC). and the recombinant BotB protein (molecular 
weight around 50 kD). size exclusion chromatography was ne.xt examined for the ability to 
separate these proteins. 



a) Size Exclusion Chromatography 

A column containing Sephacry] .S-IOO HR (S-lOO) (Pharmacia) was poured (2.5 cm x 
24 cm : • 1 10 ml bed volume). The column was equilibrated in a buffer consisting of 
phosphate buffered saline (lOmM potassium phosphate. 150 mM NaCI. pll 7.2) and 10 % 
glycerol (Mallinkrodt). Typically. 5 ml of the IDA-purified BotB protein was filtered through 
u 0.45 M syringe lliter and applied to die column, and the equilibration bulTer was pumped 
through the column at a llow rate of I ml/minute. Eluted proteins were monitored by 
absorbancc at 280 nm and collected either manually or with a fraction collector. Appropriate 
tubes were pooled, if necessary, and the protein was quanlilatcd by absorbance at 280 nm 
and/or h> BCA protein assa> . The isolated peaks were then analyzed by native and/or 
SOS-PAGE to idcntif\ the protein and evaluate the purity. 

Because of its larger size, the folding chaperone eluted first, followed by the 
recombinant BotB protein. A smaller third peak was observed which failed lo stain when 
analyzed by SDS-PAGE and therefore was presumed to be imidazole. 

SDS-PAGE analysis (12.5% polyacrylamide. reduced samples) was u.sed lo evaluate 
the purity of the IDA-purified recombinam BotB protein before and after S-IOO purification. 
The results are shown in Figure 33. 
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In Figure 33, lane I coniains broad range MW markers (BioRad). Lane 5 contains 
IDA-puritled BoiB proiein. Lane 6 contains IDA-puritled BotB protein following S-100 
puriUcalion. Lane 7 is blank (lanes 2-4 were discussed in Fx. 34 above). 

The results shown in Figure 33 show that the IDA-purilled BotB is significantly 
5 contaminated with the folding chaperone (molecular weight about 60 kl) under reducing 

conditions: lane 6). Following S-100 purification, the amount of folding chaperone present in 
the BotB sample was reduced dramatically (lane 7). Visual inspection of the Coomassie 
stained SDS-PAGK gel revealed that after S-100 purification, > W% of the total protein 
present was BotB. 

'^^ The IDA-purified BotB and the S-IOO-purified BotB samples were analyzed by UPLC 

on a size exclusion column (Shodcx KB 804); this analysis revealed that the BotB protein 
represented 64% of the total proiein in the IDA-purified sample and that following S-IOO 
purification, the BotB protein represented >95% of the total protein in the sample. 

The IDA-purified BotB material was also applied to a ACA 44 (SpeciraPor. Houston. 

1,^ fX) column. The ACA 44 resin is equivalent to the S-IOO resin and chromatography using 
the A(*A 44 resin was carried out exactly as described above for the S-100 resin. The ACA 
44 resin was found to separate the recombinant BotB protein from the folding chaperone. 
The ACA 44-purified BotB sample was analyzed for endotoxin using the LAL assay 
(Associates of Cape Cod) as describe in Hxample 24. I wo aliqouis of the ACA 44-purificd 

20 lit)tB preparation were analyzed and were found to contain either 5H to 1 16 1-li/mg 
recombinant protein or 94 to 1 89 EU/mg recombinant protein. 

These results demonstrate that size exclusion chromatography can be used to purify the 
recombinant lUilB proiein from the folding chaperone and imidazole in ID.A-purified material. 

-3 b) Ulfranitration For The Separation Of Recombinant BotB 

Protein And Chapcroncs 

Ultrafiltration was examined as an alternative method for the separation recombinant 
BotB proiein and folding chaperones in IDA-purified material. While in this example only 
mixtures of BotB and chaperones were separated by ultrafiltration, this technique is suitable 
30 for use with recombinant BotA and BotH proteins as well provided that the wash buffers used 
are altered as necessary to take into account different requirements for solubiliix-. 

fhe recombinant BotB protein and folding chaperones were separated using a two-step 
sequential ultrafiltration method. The first membrane used had a nominal molecular weight 
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cutoff (MWCO) of approximately 100 IcD; this membrane retains the larger folding chaperonc 
while allowing the smaller recombinant protein to pass through. The addition of several 
volumes of wash buffer may be required to emcicntly wash the recombinant protein through 
the membrane. The second step utilized a membrane with a nominal MWCO of 
approximately 10 kD. During this step, the recombinant antigen was retained by the 
membrane and could be concemratcd to the degree desired and the imidazole and excess wash 
buffer passed through the membrane. 

Twenty-seven milliliters of an IDA-puritlcd BotB preparation was ultrallltercd through 
a 47 mm YM 100 (100 kD MWCO) membrane (Amicon) in a 50 ml stirred cell (Amicon). 
The membrane wa.s washed in dd 11,0 prior to use as recommended by the manufacturer. Six 
volumes of 10% glycerol in PBS were washed through to remove most of the rccombinam 
iiotli protein and this wash was collected in a separate vessel. The resulting BotB 
protein-rich llltrate was then concentrated l2.fold using a YM 10 (10 kD MWCO) membrane 
I.Amicon). to a llnal v<,lume of 14 mi. The YM 100 and YM 10 concemrates were analyzed 
along with the lysate starting material by native PAGE using a 4 -15% pre-cast gradient gel 
(BioRad). I he results arc shown in Figure 37. 

In r-igurc .^7. lane 1 contains IDA-purified BotB derived from a shaker llask culture 
(I.e.. n„ co-c.xpression of chaperones: lix. 35): lane 2 contain.s a 20% wA' PRl clarified lysate 
of pHisBotB kan laclq T7/pACYCGro/BL2UDE3) cells: lane .3 shows the lysate of lane 3 
alter IDA purification: lane 4 contains the YM 10 concentrate and lane 5 contains the YM 
1 00 concentrate. 

The results shown in Figure 37 demonstrate thai the recombinant BotB protein can be 
pun lied away iVom the lolding chapcrone by ultrallltration through a 100 kD MWCO 
membrane (lane 4). leaving the chaperonc protein in the 100 kD concentrate (lane 5). 
Analysis of the sample in lane 5 also showed that very little of the BotB protein was retained 
by the 100 kl) MWCO membrane after 6 volumes of wash buffer had been applied. 

The BotB samples following IDA chromatography and following ultrafiltration through 
the YM 100 membrane were aniyzed by HPLC on a size exclusion column (Shodex KB 804): 
this analysis revealed that the BotB protein represented 64% of the total protein in the IDA- 
purified sample and that following ultrafiltration through the YM 100 membrane, the BotB 
protein represented >96% of the total protein in the sample. 

The BotB protein purified by ultrafiltration through the YM 100 membrane was 
examined for endotoxin using the LAL assay (Associates of Cape Cod) as describe in 
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Example 24, Two aliqouts of the YM lOO-purificd BotB preparation were analyzed and were 
found to contain either 18 to 36 EU/mg recombinant protein or 125 to 250 EU/mg 
recombinant protein. 

The above results demonstrate that size exclusion chromatography and ulirafiliraiion 
can be used to purify recombinant botulinal toxin proteins away from folding chapcrones. 

EXAMPLE 41 

Cloning And Expression Of The C Fragment 
Of The C. hondinum Serotype R Toxin Gene 

riic C. hotulinum type li neurotoxin gene has been cloned and sequenced from several 
diflercni strains [Poulei et al. (I992) Biochem. Biophys. Res. Commun. 183:107 (strain 
Beluga); Whelan c/ al. (IW2) liur. J. Biochem. 204:657 (strain NCTC 11219): Tujii ct al. 
(1990) Microbiol. Immunol, 34:1041 (partial sequence of strains Mashike. Iwani and Otaru) 
and Fujii a al. (1993) J. Gen. Microbiol. 139:79 (strain Mashike)). The nucleotide sequence 
of the type I: toxin gene is available from the EMBL sequence data bank under accession 
numbers X62089 (strain Beluga) and X62683 (strain NCTC 1 1219). fhc nucleotide sequence 
of the coding region (strain Beluga) is listed in SKQ ID NO:49. The amino acid sequence of 
the C. hotulinum type I: neurotoxin derived from strain Belgua is listed in SL-Q ID N():50. 
fhc nucleotide sequence of the coding region (strain NCTC 1 1219) is listed in SEQ ID 
NO:5l. The amino acid sequence of the C. hofulimtm type F: neurotoxin derived from strain 
NCTC 11219 is listed in StQ ID N():52. 

fhc DNA sequence encoding the native ( hnndinum serotype K C iVagment gene 
derived from the Beluga strain can be expressed as a histidine-tagged protein using the 
pl-Tllisb vector: the resulting coding region is listed in SEQ ID N():53 and the corresponding 
amino acid sequence is listed in SEQ ID NO:54. I'he DNA sequence encoding the C 
fragment of the native C. hoiidinum serotype C gene derived from the NCTC 1 1219 strain 
can be expressed as a histidine-tagged fusion protein using the pE'ITIisb vector: the resulting 
coding region is listed in SEQ ID NO:55 and the corresponding amino acid sequence is listed 
in SEQ ID NO:56. fhe C fragment region from any strain of handmum serotype E can be 
amplified and expressed using the approach illustrated below using the C fragment derived 
from C. hoiidinum type E 223lslrain (ATCC #17786). 
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The type E neurotoxin gene is synthesized as a single polypeptide chain which may be 
converted to a double-chain form (/.c. a heavy chain and a light chain) by cleavage with 
trypsin; unlike the type A neurotoxin, the type E neurotoxin exists essentially only in the 
single-chain fomi. The 50 kD carboxy-terminal portion of the heavy chain is referred to as 
the C fragment or the H,. domain. H.xpression of the C Iragment of C. hoiiilimim type f- toxin 
in heterologous hosts {t'.^'.. E. coii) has not been previously reported. 

The native C fragment of the C. hoiulinum .serotype E toxin (BotE) gene was cloned 
and inserted into expression vectors to facilitate expression of the recombinant BotE protein in 
L. vnli. This example involved PCR amplification of the gene, cloning, and construction of 
expression vectors. 

The BotE serotype gene was isolated using PCR as described for the BotA serotype 
gene in Example 28. The C. hoiulinum type E strain was obtained from the American Type 
Culture Collection (ATCC #17786: strain 2231). The following primer pair was used in the 
I'CR amplification: .V-CGCCATGGCTCTTTCTTCTTAT ACAC.ATGAT-.r (.V primer, 
engineered \a>\ site underlined) (SEQ ID NO:57) and 

.^'-(^C•AMiC^7T.•^TTTT•^•CTTGCC.•ATC•CATG-3• {V primer, engineered ////,dlll site 
underlined, native gene termination codon italicized) (SEQ IIJ N():58). The PCR product was 
inserted into pCRscript as described in Example 28. The resulting pCR.scripi BotE clone was 
conlirmcd by restriction digestion, as well as. by obtaining the sequence of approximately 300 
bases located at the 5' end of the C fragment coding region using standard DNA sequencing 
methods. The resulting BotE sequence was identical to tliat of the published C. hoiulinum 
type M lo.xin sequence fWhelan ci at (1992). xupra\. 

The A/jt'!(nilcd)////>7dm fragment from a pCRscript BotE recombinant was cloned into 
pETHisb vector as described for BotA C fragment in Example 28. The resulting construct 
was termed pHisBotE. pHisBotE expresses the BotE gene under the control of the T7 lac 
promoter and the resulting protein contains an N-terminal IOXF4is-tag affinity tag. 

The pHisBotE expression construct was transformed into BL2I(DE3) pLy.s.S competent 
cells and I liter cultures were grown, induced and his-tagged proteins were purified utiliicing a 
NiNl A icsin (eluted in low pH elution buffer) as described in Example 28. Total, .soluble 
and purified proteins were resolved by SDS-PAGE and detected by Coomassie staining. The 
results are shown in Figure 38. 

In Figure 38. lane I contains broad range MW markers (BioRad): lane 2 contains a 
total protein extract: lane 3 contains a soluble protein extract: lane 4 contains proteins present 
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in the How through from the NiNTA column (this sample was not diluted prior to loading and 
therefore represents a load 5X that of the load applied for the total and soluble extracts in 
lanes 2 and 3); lane 5 contains proteins eluted from the NIN TA column: lane 6 contains 
protein eluted from a NiNTA column which had been stored at -20°C for 1 year. 

The pllisBolE protein was expressed at moderate levels (7 mg/Iiler) as a totally soluble 
protein. I'he purified protein migrated as a single band ol the predicted MW. 

Western blot hybridization utilizing a chicken anti-C. hotulhmm serotype \\ toxoid 
primary antibody (generated by immunization of hens as described in Example 3 using c: 
hotulhwm serotype E toxoid ) was also performed on the lolal. soluble and purilied BoiE 
proteins. Samples of BotA and BotB C fragments were also included on the gels to facilitate 
MW and immunogeniciiy comparisons. Strong immunoreactivity was detected using the 
anti-C. hoiulimim type E toxoid antibody only with the HoiE protein. 

These results demonstrate that the native BolE gene sequences can be expressed as a 
soluble his-iagged protein in E. coU and purified by metal-chelation affmiiy chromatography. 

EXAMPLE 42 

(ieneration Of Neutralizing Antibodies Using fhe Recombinant plIisBoiE Protein 

The ability of the purified pHisBotE protein to generate neutralizing antibodies was 
examined. Nine BAEBc mice were immunized with BotE protein (purified as described in 
Ex. 41) using Cierbu GMDP adjuvant (CC Biotech). The low pH elution was mixed with 
Cierbu adjuvant and used to immunize mice. Each mouse received a subcutaneous injection 
of 100 nl antigen/adjuvant mix (35 |.ig antigen + I j.ig adjuvant) on day 0. Mice were 
subcutaneously boosted as above on day 14 and bled on day 28. Mice were subsequently 
boosted and bled on day 70. 

Anti-C hotulinum serotype E toxoid titers were determined in day 28 serum from 
individual mice from each group using the ELISA protocol outlined in Example 29 with the 
exception that the plates were coated with C. hotulinum serotype E toxoid, and the primary 
antibody was a chicken anti-C. hotulinum serotype E toxoid. Seroconversion (relative to 
control mice immunized with the pOxlIisBotA antigen (l-x. 29)] was observed with all 9 mice 
immunized with the purified pHisBotI: protein. 

The ability of the anti-BoiE antibodies to neutralize native (\ hotulinum type E toxin 
was tested in a mouse-C. Iwtulinum neutralization model using pooled mouse serum (see Ex. 
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23b). The LD,„ of purified C hoiulinum type E toxin complex (Dr. Eric Johnson. University 
of Wisconsin. Madison) was determined by a intraperitoneal (IP) method (Schantz and Kautler 
(1978). supra! using 18-22 g female ICR mice. The amount of neutralizing antibodies prcsem 
in the serum of the immunized mice vvas determined using serum antibody titrations. The 
various serum dilutions (0.01 ml) were mixed with 5 LD,. units of C. hondimm type F toxin 
and the mixtures were injected IP imo mice. The neutralizations were performed in duplicate. 
The mice were then observed for signs of botulism for 4 days. LJndilutcd serum from day 28 
did not protect, while undiluted. l/IO diluted and 1/100 diluted day 70 serum protected (1005 
of animals) while 1/1000 diluted day 70 scrum did not. This corresponds to u neutralization 
liter of 50-500 lU/ml. 

These results demon.strate that seroconversion occurred and neutralizing antibodies 
were induced when the recombinant BotK protein was utilized as the immunogcn. 

EXAMPLE 43 

Construction Of Vectors l o Facilitate Expression 
Of His-Tagged BotE Protein In Fermentation Cultures 

A number of expression vectors were constructed lo facilitate the expression of 
recombinant BotI- protein in large scale fermentation culture. These constructs varied as to 
the stronuih of the promoter utilized (T7 or T71ac) and the presence of repressor elements 
flaclci) on the plasmid. The resulting constructs varied in the level of expression achieved and 
in plasmid stability ujiich facilitated the .selection of a optimal expression system for 
Icrmcntation scaleup. This example involved a) construction of BotE expression vectors and 
b) determination of expression levels in small scale cultures. 

SI) Construction Of BotE Expression Vectors 

The BotE expression \ ectors created for fermentation culture were engineered lo 
ulilize the kanamycin rather than ihc ampicillin resistance gene, and contained either the T7 or 
T7lac piomoler. with or without the laclq gene for the reasons outlined in Example 30. 

In all cases, the protein expressed by the various expression vectors is the pHisBotE 
protein described in E.xample 41. with the only differences between clones being the alteration 
of various regulatory elements. Using the designations outlined below, the plIisBotli clone 
(Ex. 41) is equivalent to pHisBotH amp T7lac. 
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i) Construction Of pHisBotE kan hiclq T7lac 

pHisBotE kan laclq T71ac was conslructcd by inserting the Xhal/HindlU fragment of 
pHisBotE which contains the BolE gene sequences into A'A^/I/Z/Z/rdlll-cleaved pnT24a vector. 
Proper construction was confirmed by restriction digestion. 

ii) Construction Of plIisBotE kan T7 

pIlisBotE kan '17 was constructed by ligating the Botli-coniaining XhaUSapl fragment 
of pHisBotl: kan laclqT7lac to the T7 promoter-containing XhaUSapl fragment of pr:T23a. 
Proper construction was confirmed by restriction digestion. 

ill) Construction Of pHisBotlC kan lacIqT7 

pHisBotE kan laclqT7 was constructed by inserting ihc Z?.i:/ll//y//7dlll Iragmenl from 
pHisBoiE kan 17 which contains the BotH gene sequences into /^^,'/II////ndIII-cleaved pET24 
vector. Proper construction was confirmed by restriction digestion. 

h) Determination Of BotE Expression Levels In Small Scale 
Cultures 

i hc three BotE kan expression vectors described above were transformed into 
B121(DE3) competent cells and 50 ml (2XYT + 40 |.ig/ml kan) cultures were grown and 
induced with ITPG as described in Example 28. Total and soluble protein extracts from 
before and after induction made as described in Example 28. The total and soluble extracts 
were resolved on a 12.5% SDS-PAGE geL and his-tagged proteins were detected on a 
Western blot utilizing the NiNTA-alkaline phosphatase conjugate as described in Example 
31(c)(iii). The results showed that all three BotE cell lines expressed his-tagged proteins of 
the predicted MW for the BotE protein upon induction. The results also demonstrated that 
the two constructs that contained the T7 promoter expressed the BotE. protein before 
induction, while the T7lac promoter construct did not. Upon induction, the T7 promoter- 
containing constructs induced to higher levels than the T7lac-containing construct, with the 
pnisBoiE kan Iaclq r7/BI2I(DE3) cells accumulating the maximal levels of BotE protein. 
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EXAMPLE 44 

Expression And Purification Of pHisBotE From Fermentation Cultures 

Based on the small scale inductions performed in Example 43. the pHisBotE kan laclq 
T7/Bi21(DF.3) strain was selected for fermentation scaleup. This example involved the 
lermcntation and purification of recombinam BotE C fragment protein. 

A fermemation with the pHisBotE kan laclq T7/B12I(DF3) strain was performed as 
described in Example 31. Tlie fermentation culture was induced 2 hrs post start of the 
gluco.se feed with 4 gm IPTG (final concemration = 1.6 mM). The 0D„„, was 42 at time of 
induction, then 46.5. 48. 53 and 54 at 1-4 hrs post induction. Viable colony counts decreased 
from 0-4 hr induction 1131. 4 (28). 7 (3), 7. 8; dilution 3 utilized 6 mI of dilution 2 cells: 
bracketed colonies are microcolonies]. All (32/32) colonics .scored at the time of induction 
retained the BotE plasmid (kan resistant) and no colonies at induction grew on IPTO+Kan 
plates (no mutations detected). These results were indicative of strong promoter induction, 
since colony viability reduced after induction, and the culture stopped growing during 
fermentation (stopped at 54 OD„„/ml). 

Total and soluble extracts were resolved on a 12.5% SDS-PAGH gel and total protein 
was detected by staining with Coomassie blue, fhe results are shown in Figure 39. 

In Figure 39. lane 1 contains total protein from a pHisBotA kan T7 lac/B12I(DE3) 
pl.y.s.S fermentation (Ex. 24). Lane.s 2-9 comain extracts prepared from the above pHisBotE 
kan laclq T7/B121(DE3) fermentation; lanes 2- 4 contain total protein extracts prepared at 0. 1 
and 2 h()ur.s po.st-induction. respectively. Lane 5 contains a soluble protein extract prepared at 
2 hours p*^si-induction. Lanes 6 and 7 contain total and soluble extracts prepared at 3 hours 
post-induction, respectively, Lanes 8 and 9 contain total and soluble extracts prepared at 4 
hours po.st-induction. respectively. Lane 10 contains broad range MW markers (BioRad). 

The results shown in Figure 39 demonstrate that moderate level induction of totally 
soluble Bot H protein was observed, increasing from I to 4 hrs post induction (no expression 
was detected in uninduced cells). From a 2 liter fermentation harvest a 155 gm (wet wt) cell 
pellet was obtained and used to make a PEI-clarified lysate (I liter in CRB. pH 6.8). The 
ly.satc was applied to a large scale IDA column and 200 mg of BotE protein, which was 
found to be greater than 95% pure (as Judged by visual inspection of a Coonmsie stained 
SDS-PAGE gel), was recovered. This represcms 2.5% of the total soluble cellular protein 



- 207 - 



wo 98/08540 



PCTAJS97/15394 



(assuming a PEI lysatc having a concenlraiion of 8 mg proiein/ml) and corresponds to a yield 
of 100 mg BotE proiein/liter of fermeniaiion culture. 

The above rcsuUs demonstrate that high levels of the recombinant BoiE protein can be 
expressed and purified from fermentation cultures. 

EXAMPLE 45 

Removal Of Imidazole From Purified Recombinant BotE Protein Preparations 

The expression of recombinant BotE protein, unlike the UoiA and [ioiB proteins, did 
not require the presence of folding cliaperones to maintain solubiliiy during scale-up. A size 
exclusion chromatography step was included however to remove the imidazole from the 
sample and exchange the IDA elution buffer for one consistent with the BoiA antigen. 

A Sephacryl S-lOO MR (S-lOO: Pharmacia) column was poured (2.5 cm n 24 cm: bed 
volume - 1 10 ml). Under these conditions, the BotH protein should be retained by the beads 
to a lesser degree than the smaller imidazole, therefore the BotE protein should eluie from the 
column before the imidazole. The column was equilibrated in a buffer consisting of 50 niM 
sodium phosphate, 0.5 M NaCI. and 10% glycerol (all reagents irom Mallinkrodt), Mve 
milliliters of the IDA-puritled BotE protein (Ex. 44) was filtered through a 0.45 /i syringe 
filler and applied to the S-lOO column, and equilibration butfer was pumped through the 
column at a flow rate of 1 ml/minute. Eluied proteins were monitored by absorbance at 280 
nm. and collected either manually or with a fraction collector. Appropriate tubes were pooled 
if necessary, and the protein was quantiiated b\ absorbance at 2X0 nm and/or 1K\\ protein 
assay. The isolated peaks were then analyzed by native and/or SDS-PAUl: to identify the 
protein(s) and evaluate the purity. 

Figure 40 provides a representative chromalogram showing the purification ol* 
IDA-purified BotE on the S-lOO column. Even though folding chaperones were not 
over-expressed with this antigen, a small amount of protein eluted at a time consistent with 
the folding chaperones expressed with BotA and BotB proteins (Gro) (see the first peak). The 
.second peak in the chromatogram contained the Botli protein, and the third peak was 
presumably imidazole. This presumed imidazole peak was i.solaled in comparable levels in 
IDA-purified BotA and BotB protein preparations as well. 
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These results demonstrate that size exclusion chromatography can be used to remove 
imidazole and traces of contaminating high molecular weight proteins from IDA-purified 
BotF protein preparations. 

The S-lOO-puriHed BmE protein was tested for endotoxin contamination using the 
l-AI. assay as described in Example 24. This preparation was found to contain 64 to 128 
HlJ/mg recombinant protein and is therefore substantially free of endotoxin. 

The S-lOO purified BotE was mixed with purified preparations of BotA and BotB 
proteins and used to immunize mice: 5 Mg of each Bot protein was used per immunization 
and alum was included as an adjuvant. After two immunizations with this trivalcnt vaccine, 
the immunized mice were challangcd with C. honilinum toxin. The immunized mice 
contained neutralizing antibodies sufficient to neutralize between 100.000 to 1.000.000 I.D,„ 
of either toxin A or toxin B and between 1.000 to 1 0.000 I.D„. of toxin H. The tiler of 
neutralizing antibodies directed against toxin E would be expected to increase following 
suKsequent boosts with the vaccine. These resuhs demonstrate that a trivaleni vaccine 
containing recombinant BotA. BotB and BotE proteins provokes neutralizing antibodies. 

EXAMPLE 46 

Expression Of The C Fragment Of The ( hmulintim 
Serotype C Toxin Gene And Generation Of Neutralizing Antibodies 

The c: hoinlhwm type CI neurotoxin gene has been cloned and .sequenced [Kimura t-/ 
al. (IWO) IJiochcm. Biophys, Res. Comm. 171:1304]. The nucleotide .sequence of the toxin 
gene deriN cd from the C. boiulinum type C .strain C-Stockholm is available from the 
l-Mni./Genliank sequence data banks under the accession number Dy02IO: the nucleotide 
•sequence of the coding region is listed in SEQ ID NO:59. The amino acid sequence of the C. 
hoiulmum type CI neurotoxin derived from this strain is listed in .SEQ II) NO:60. 

The DNA sequence encoding the native C. hottilinum serotype CI C fragment gene 
derived from the C-Stoekholm strain can be expressed using the pETHisb vector: the resulting 
coding region is listed in SEQ ID NO:61 and the corresponding amino acid sequence is listed 
in SEO If) NO:62. The C fragment region from any strain of C. hniulhmm .serotype C can be 
amplilled and expressed using the approach illustrated below using the C fragment derived 
Irom C. hntulinum type C C-Stockholm strain. Expression of the C fragment of C hn,ulim,m 
type CI toxin in heterologous hosts {ca.. E. coli) has not been previously reported. 
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The C fragment of the C holulinum serotype CI (BolCl) toxin gene Ls cloned using 
the protocols and conditions described in Example 28 for the isolation of the native BotA 
gene. A number of C hotulimm serotype C strains (expressing either or both CI and C2 
toxin) arc available from the ATCC [e.^.. 2220 (ATCC 17782). 2239 (ATCC 17783), 2223 
(A TCC 17784: a type C-(} strain; C-P strains produce C2 toxin), 662 (ATCC 17849: a type 
C-ix strain: C-a strains produce mainly CI toxin and a small amount of C2 toxin). 2021 
(ATCC 17850: a type C-(x strain) and VPI 3803 (ATCC 25766)|. Allernatively. other type C 
strains may be employed for the isolation of sequences encoding the C fragment of C\ 
hotulimim serotype C toxin. 

The following primer pair is used to amplify the BotC gene: .5'>CG CCATGG C 
TTTA rTAAAAGATATAArTAATG-3' [5' primer, engineered .Vtv;l site underlined (SRQ ID 
NO:63)l and D^GCAMKXE'/T/i'lTCACTTACAGGTAC AAAACC-3* |3* primer, 
engineered //mdlll site underlined, native gene termination codon italicized (SEQ ID 
NO:64)|. I'ollowinLi PCR amplification, the PCR product is inserted into the pCRscript vector 
and then the 1.5 kb fragment is cloned into pl-THi.sb vector as described for BotA C fragment 
gene in i;\ample 28. The resulting construct is termed pHisBotC. Proper construclion is 
confirmed by DNA sequencing of the BotC sequences contained within pllisBolC. 

pHisBotC expresses the BotC gene sequences under the transcriptional control of the 
T7 lac promoter and the resulting protein contains an N-terminal lOXI lis-tag affinity tag. I he 
pHisBotC expression construct is transformed into BI.21(DE3) pLysS competent cells and 1 
liter cultures are grown, induced and his-lagged proteins are purified utilizing a NiNTA resin 
(eluled in 250 niM imidazole. 20% glycerol) as described in F.xample 28. l otal. soluble and 
purified proteins are resolved b> SDS-PAGH and detected by Coomassie .staining and Western 
blot hybridization utilizing a Ni-N TA-alkaline phosphatase conjugate (Qiagen) which 
recognizes his-tagged proteins as described in Example 3l(c)(iii). Fhis analy.sis permits the 
determination of expression levels of the pIlisBotC protein (/.c., number of mg/liter expressed 
as a soluble protein). The purified BotC protein will migrate as a siniile band ol* the predicted 
MW (/.c. -oOkD). 

The level of expression of the pHisBotC protein may be modified (increased) by 
substitution of the 17 promoter for the T7lae promoter, or by inclusion of the laclq gene on 
the expression plasmid. and plasmid expressed in BL2 1(003) cell lines in fermentation 
cultures as described in Example 30. If only very low levels (/.l'.. less than 0.5%) of soluble 
pHisBotC protein are expressed using the above expression sy.stems. the pHisBotC construct 
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may be co-expressed with pACYCGro construct as described in Example 32. In this case, the 
recombinant BotC protein may co-purify with the folding chaperoncs. The contaminating 
chaperones may be removed as described in Example 34. Preparations of purified pHisBoiC 
protein are tested for endotoxin contamination using the I-AL assay as described in Fixample 
24. 

The purifed pHisBoiC protein is used to generate neutralizing antibodies, BALBc 
mice are immunized with the BotC protein using Gerbu GMDP adjuvant (CC Biotech) as 
described in Example 36. The ability of the anti-BoiC antibodies to neutralize native C. 
hntulimim type C toxin is demonstrated using the mouse-C". hon4Unum neutralization model 
described in Example 36. 



EXAMPLE 47 

Expression Of The C Fragment Of The C hntulimun 
Serotype D Toxin Gene And Generation Of Neutralizing Antibodies 

The i\ hotulhwm type D neurotoxin gene has been cloned and sequenced fSunagawa 
ei ai. (1992) j. Vet, Med. Sci. 54:905 and Binz el cil. (1990) Nucleic Acids Res. 18:5556]. 
The nucleotide sequence of the toxin gene derived from the CBI6 strain is available from the 
RMBL/GenBank sequence data banks under the accession number S49407: the nucleotide 
sequence of the coding region is listed in SEQ ID NO:65. The amino acid sequence of the C. 
hotidhnim type D neurotoxin derived from the CB16 strain is listed in SEQ ID NO:66. 

1 he DNA sequence encoding the native C\ hotulhwm serotype D C Iragment gene 
derived Irom a BotD expressing strain can be expressed using the pETlIisb vector: the 
resulting eoding region is listed in SEQ ID NO:67 and the corresponding amino acid sequence 
IS listed in SEQ ID NO:68. The C fragment region iVom any strain of (.'. hoiulhmm serotype 
D can be amplified and expressed using the approach illusiraled below using the C fragment 
derived from ( hoiulmum type D CBI6 strain. Expression of the C fragment ol' C. 
hoiulhium type D toxin in heterologous hosts (c'.^r.. E. co/i) has not been previously reported. 

The C fragment of the C \ homlmum serotype D (BotD) toxin gene is cloned using the 
protocols and conditions described in Example 28 for the isolation of the native BotA gene. 
A number of C\ hofulmum type D strains are available from the ATCC ATCC 9633. 
2023 (ATCC 17851). and VPl 5995 (ATCC 27517)]. 



- 211 - 



wo 98/08540 



PCT/US97/15394 



The following primer pair is used to amplify the BotD gene: S'-CG CCATGG C 
TTTATTAAAAGATATAATTAATG-3' [5" primer, engineered Ncol site underlined (SIZQ ID 
NO:63)) and 5^GCMG£n7T/lCTCTACCCATCCTGGATCCCT-3' \y primer, engineered 
//mdlll site underlined, native gene termination c(xlon italicized (SEQ ID N():69)l. 
Following PCR amplification, the PGR product is inserted into the pCRscript vector and then 
the 1.5 kb fragment is cloned into pETHisb vector as described for BotA C fragment gene in 
Kxample 28. The resulting construct is termed pHisBotD. 

pHisBotD expresses the BotD gene sequences under Ihe transcriptional control ol' the 
T7 lac promoter and the resulting protein contains an N-terminal lOXIIis-tag affinity lag. The 
pHisBotD expression construct is transformed into BL21{DE3) pLysS competent cells and I 
liter cultures are grown, induced and his-tagged proteins arc purified utilizing a NiNTA resin 
as described in Example 28. Total, soluble and purified proteins are resolved by SDS-PAGE 
and detected by Coomassic .staining and Western blot hybridisation utilizing a Ni-NTA- 
alkaline phosphatase conjugate (Qiagen) which recognizes his-tagged proteins as described in 
Example 3l(c)(iii). This analysis permits the determination of expression levels of the 
pHisBotD protein (i.e.. number of mg/liier expres.sed as a soluble protein). The purified BotD 
protein will migrate as a single band of the predicted MW (i.e.. -50kD). 

fhc level of expression of the pHisBotD protein may be modified (increased) by 
substitution of the T7 promoter for the T7lac promoter, or by inclusion of the iaclq gene on 
the expression plasmid. and plasmid expressed in Bl.21(DE3) cell lines in fermentation 
cultures as described in E.xample 30. If only very low levels (i.e.. less than about 0.5%) of 
soluble pHisBoiD protein are expressed using the above expression systems, the pHisBoiD 
construct may be co-expressed with pACYCGro construct as described in Example 32. In 
this case, the recombinant BotD protein may co-purify with the folding chaperoncs. The 
contaminating chaperones may be removed as described in Example 34. Preparations of 
purified pHisBotD protein are tested for endotoxin contamination using the LAL assay as 
described in Example 24. 

The purifed pHisBotD protein is ujjcd to generate neutralizing antibodies. BALBc 
mice are immunized with the BotD protein u.sing Gerbu GMDP adjuvant (CC Biotech) as 
described in Example 36. The ability of the anti-BotD antibodies to neutralize native C. 
hotullnum type D toxin is demonstrated using the mouse-C. hutuUnvm neutralization model 
described in lixample 36. 
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EXAMPLE 48 

Expression Of The C Fragment Of The C. hoiulinum 
Serotype F Toxin Gene And Generation Of Neutralizing Antibodies 

The c: hoiulimim type F neurotoxin gene has been cloned and sequenced [ East ei al. 
(mi) KEMS Microbiol. Lett. 96:225]. The nucleotide sequence of the toxin ycne derived 
from the 202F strain (ATCC 23387) is available from the EMBL/GcnBank sequence data 
banks under the accession number M92906: the nucleotide sequence of the coding region is 
listed in SEO ID N():70. The amino acid sequence of the C". hoiulinum type F neurotoxin 
derived from the 202F strain is listed in SEQ ID N0:7I. 

The DNa sequence encoding the native C. hoiulinum serotype I- C IVagment gene 
derived from the 202F strain can be expressed using the pETHisb vector: the resulting coding 
region is listed in SEQ ID NO:72 and the corresponding amino acid .sequence is listed in SEQ 
ID N():73. The C fragment region from any strain of C. hoiulinum serotype F can be 
amplified and expres.scd using the approach illustrated below using the C fragment derived 
from C. hoiulinum type F 202I- strain. Expression of the C fragment of C. hoiulinum type F 
toxin in heterologous liosis (u.fi.. E. coli) has not been pre\ iously reported. 

Tlie C IVagment oC the C. hoiulinum serotype F (BotI-) toxin gene is cloned using the 
protocols and conditions described in Example 28 for the isolation of the native BotA gene. 
The r hoiulinum type I- 202F strain is obtained from the American Type Culture C ollection 
(A ICC 2.1387). Alternatively, sequences encoding the BotF toxin may be isolated from any 
lioti- expressing strain [c.^j.. VPI 4404 (ATCC 25764). VPI 2382 (AlCC 27321 ) and 
Langeiand (ATCC 354 1 5) |. 

I he following primer pair is used to amplify the BotF gene: 5'-CG£CATGGC 
TA ITCTAAT rATArATTTTAATAG-3- [5" primer, engineered Nco] site underlined (.SEQ 
ID N():74)) and 5^GCMyCTr7-C'W-rTCTTTCCATCCATTCTC-3- [3" primer, engineered 
IlindWl .site underlined, native gene termination codon italicized (SEQ ID NO:75)]. 
Following PCR amplilication. the PCR product is inserted into the pCRseript vector and then 
the 1.5 kb fragment is cloned into pETHi.sb vector as described for BotA C fragment gene in 
Example 28. The resulting construct is termed pHisBotF. 

pMlsBotF expresses the BotF gene sequences under the transcriptional control of the 
T7 lac promoter and the resulting protein contains an N-temiinal lOXHis-tag alfmity tag. The 
pHisBotF expression construct is traasformcd into BL2l(DE3) pLy.sS competem cells and I 
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liter cultures arc grown, induced and his-iagged proteins arc purified utilizing a NiNTA resin 
as described in Example 28. Total, soluble and purified proteins are resolved by SDS-PAGE 
and detected by Coomassic staining and Western blot hybridization utilizing a Ni-NTA- 
alkalinc phosphatase conjugate (Qiagen) which recognizes his-taggcd proteins as described in 
lixample 3 l(c)(iii). This analysis permits the determination of expression levels of the 
pHisBolF protein {ix\. number of mg/lilcr expressed as a soluble protein). The purified BotF 
protein will migrate as a single band of the predicted MW (^c^. - 5()kD). 

The level of expression of the pHisBotF protein may be modified (increased) by 
substitution of the T7 promoter for the T7lac promoter, or by inclusion of the laclq gene on 
the expression plasmid. and plasmid expressed in BL21(DE3) cell lines in fermentation 
cultures as described in Example 30. If only very low levels (/.c. less than about 0.5%) of 
soluble plIisBoiF protein are expressed using the above expression systems, the pIIisBotF 
construct may be co-expressed with pACYCOro construct as described in Example 32. In 
this case, the recombinant BotF protein may co-purify with the folding chapcrones. The 
contaminating chaperones may be removed as described in Example 34. Preparations of 
purified pHisBotF protein are tested for endotoxin contamination using the LAL assay as 
described in Example 24. 

The purifed pIIisBotF protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the BotI' protein using Gcrbu GMDP adjuvant (CC Biotech) as 
described in Example 30. The ability of the anti-BolF antibodies to neutralize native ( \ 
hoiulinum type 1-^ toxin is demonstrated using the mouse-('. hotu/hmm neutralization model 
described in Example 36. 



EXAMPLE 49 

Expression Of The C Fragment Of The ( \ bondmum 
Serotype G Toxin Cicne And Generation Of Neutralizing Antibodies 

The hoiitUnum type G neurotoxin gene has been cloned and sequenced | Campbell et 
uL (1993) Biochimica el Biophysica Acta 1216:487 and Binz tv ciL (1990) Nucleic Acids Res. 
18:5.5.561. The nucleotide sequence of the toxin gene derived from the 1 13/30 strain (NCFB 
3012) is available from the EMBL/GenBank .sequence data banks under the accession number 
X74162: the nucleotide sequence of the coding region is listed in SEQ ID NO:76. The amino 
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acid sequence of tlie C. hotulinum type G neurotoxin derived from this strain is listed in SEQ 
ID NO:77. 

The DNA sequence encodinji the native C. hotulimm serotype G C: fragment gene 
derived from the 1 13/30 strain can be expressed using the pETHisb vector; the resulting 
coding region is listed in SEQ ID NO:78 and the corresponding amino acid sequence is listed 
in SEQ ID NO: 79. The C fragment region from any strain of C. hotulimnn serotype G can 
be amplified and expressed using the approach illustrated below using the C iragmem derived 
from r. b,uulim,m type G 113/30 strain. Expression of the C fragment off. boiulinum type 
G toxin in heterologous hosts {e.^.. E. coli) has not been previously reported. 

The C fragment of the C. hotulimm serotype G (BotG) toxin gene is cloned using the 
protocols and conditions described in Example 28 for the isolation of the native BolA gene. 
The C. batuUmm type G 1 13/30 strain is obtained from the NCFB. The following primer 
pair is used to amplify the BolCJ gene: 5 -CG CCAI GG CTGA(' ACAATTTTAA TACA 
AG 1-3" f.'^" primer, engineered Nco\ .site underlined (SEQ ID NO:80)J and 
5 -GCCIIXJA(irr.rrTCTGTCCA rCCTTCATCCAC-3- I3- primer, engineered Xhni site 
underlined, native gene termination codon italicized (SEQ ID N0:8I)|. following PCR 
amplification, the PCR product is in.serted into the pCRscript vector and then the 1.5 kb 
fragment is cloned into pETHisb vector as described for BoiA C fragment gene in Example 
28 with the exception that the sequences encoding BotG are excised from the pCRscript 
\ector by digestion with Nco\ and A7i«l and the Nco\ site is blunted (the BotG sequences 
contain an internal ///mllll site). This ,Va>l(fillcdVA7»«l fragment is then ligatcd to the 
pETMisb vector which has been digested with Nhit\ and .S«/l and the A7u'l site is blunted. 
The resulting con.struct is termed plIisBotG. 

pHisBotG expresses the BotG gene sequences under the transcriptional control of the 
r? lac promoter and the resulting protein contains an N-terminal lOXHis-tag affinity tag. The 
pHisBotG expression construct is transformed into BL2UDE3) pLysS competent cells and 1 
liter cultures are grown, induced and his-tagged proteins are purified utilizing a NiNTA resin 
as described in Example 28. Total, soluble and purified proteins are resolved by SDS-PAGE 
and detected by Coomassie staining and Western blot hybridization utilizing a Ni-NTA- 
alkaline phosphatase conjugate (Qiagen) which recognizes his-tagged proteins as described in 
lixample 3l{cXiii). This analysis permits the detennination of expression levels of the 
pHisBotG protein (/.t-.. number of mg/liier expressed as a soluble protein). The purified BotG 
protein will migrate as a single band of the predicted MW (i.e.. -SOkD). 
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The level of expression of ihe pHisBolG protein may be modified (increased) by 
substitution of the T7 promoter for the T71ac promoter, or by inclusion of the laclq gene on 
the expression plasmid. and plasmid expressed in BL2UDE3) cell lines in fermentation 
cultures as described in Example 30. if only very low levels (i.e. less than about 0.5%) of 
soluble pHisBotG protein are expressed using the above expression systems, the pHisBolCi 
construct may be co-expressed with pACYCGro construct as described in l-xample 32. In 
this case, the recombinant BolG protein may co-purify with the folding chapcrones. The 
contaminating chaperones may be removed as described in Example 34. Preparations of 
purilied pHisBoiG protein are tested for endotoxin contamination using the LAL assay as 
described in Example 24. 

The purifed pHisBotG protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the BoiG protein using Gerbu GMDP adjuvant (C*C Biotech) as 
described in Example 36. The ability of the anti-BotG antibodies to neutralize native C \ 
hoiu/inum type Ci toxin is demonstrated using the mouse-c; homlinum neutralization model 
described in Example 36. 



EXAMPLE SO 

Expression Of Recombinant Botulinal Toxin Proteins In Eucaryoiic Most Cells 

Recombinant botulinal C fragment proteins may Ix; expressed in eucaryoiic host cells, 
such as yeast and insect cells. 



a) Expression In Yeast 

Botulinal C fragments derived from serotypes A. B. C. D, E. F and G may be 
25 expressed in yeast cells using a variety of commercially available vectors. I'tir example, the 
pPIC3K and pPIC9K expression vectors (Invitrogen) may be employed for expression in the 
methylotrophic yeast, Pichia paxtons. When the pPIC3K vector is employed, expression of 
the botulinal C fragment protein will be intracellular. When the pPIC3K vector is employed, 
the botulinal C fragment protein will be secreted (the alpha factor .secretion signal is provided 
30 on the pPIC9K vector). 

DNA sequences encoding the desired C fragment is inserted into these vectors using 
techniques known to the art. Briefly, the desired botulinal expression cassette (including 
.sequences encoding the his-iag: described in the preceding examples) is amplified using the 



- 216 - 



wo 98/08540 



PCT/US97/15394 



PGR in conjunction with primers that incorporate unique restriction sites at the termini of the 
amplified fragment. Suitable restriction enzyme sites include SnaBL EcoRl AvrU and No(\, 
When the botulinal C fragment is to be expressed using the pPIC3K vector, the initiator 
methionine (ATG) is provided by the desired Bot gene sequence and a Kozak consensus 
sequence is engineered upstream of the ATG (c jif., ACC ATG G). 

The amplified restriction fragment containing the botulinal C fragment gene is then 
cloned into the desired expression vector Recombinant clones are integrated into the Pichia 
paxtoris genome and recombinant protein expression is induced using methanol following the 
manufacturer's insiruclions (Inviirogen Pichia expression kit manual). 

(\ hotuUnum genes are A/T rich and contain multiple sequences that arc similar to 
yeasi transcriptional termination signals TTTTTATA). If premature transcription 
termination is observed v^hen the botulinal C fragment genes are expressed in ycasi. the 
transcription termination signals present in the C fragment genes can be removed by cither 
site directed mutagenesis (utilizing the pALTER system: Promega) or by construction of 
synthetic genes utilizing overlapping synthetic primers. 

The botulinal C fragment genes may be expressed in other yeast cells using other 
commercially available vectors [e.^^,, using the pYES2 vector (Invitrogen) and S. cercvisiae 
cells (Invitrogen)]. 

b) Expression In Insect Cells 

Botulinal C fragments derived from serotypes A. B. C. D. E. F and G may be 
expressed in insect cells using a variety of commercially available vectors. For example, the 
pBlueBac4 transfer vector (Inviirogen) may be employed for expression in Spodoptera 
Jnif^ipurda {Sf)) insect cells (baculovirus expression system) (equivalent baculovirus vectors 
and host cells are avaialble from other vendors, e.}^., Pharmingen. San Diego, CA). Botulinal 
C fragments contained on Ncol/Hindlll fragments contained within the pHisBotA-G 
expression constructs (described in the preceding examples) are cloned into the pBlueBac4 
vector (digested with Nco] and HindlU): the Ncol site present on the C fragment constructs 
overlaps with the start codon of the fusion proteins. In the case of botulinal C fragment 
clones that contain internal HindlU sites {c.f^,. using the BotG sequences described in Ex. 49). 
the C fragment gene is contained within a Ncol/Xhol fragment on the pMisBot construct. 
This NcoMXhol fragment is excised from pHisBot and inserted into pBlueBac4 digested with 
AxYii and .SWI. Recombinant baculoviruses are made and the desired recombinant C fragment 
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is expressed in S/9 cells using the protocols provided by the manufacturer (Invitrogen MaxBac 
manual). The resulting constructs will express the pHisBot protein intracelluiarly (including 
the N-tcrminal his-tag) under the control of the polyhedrin promoter. For extracellular 
secretion of botulinal C fragment proteins, the C fragment sequences from the pHisBot 
constructs arc cloned into the pMelBacB vector (Invitrogen) as described above for the 
pBlueBac4 vector. When the pMelBacB vector is employed, the his-tagged botulinal C 
fragment proteins are secreted (utilizing a vector-encoded honeybee melittin secretion signal) 
and contain a nine amino acid extension at the N-terminus. 

His-tagged botulinal C fragments expressed in yeast or insect cells are purified using 
metal chelation columns as described in the preceding examples. 

From the above it is clear that the presem invention provides compositions and 
methods for the preparation of effective multivalem vaccines against C. haiulmum neurotoxin, 
it is also contemplated that the recombinant botulinal proteins be u.scd for the prtxluction of 
antitoxins. All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method and 
system of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Williams, James A. 

Thalley, Bruce s. 

(ii) TITLE OF INVENTION: Multivalent Vaccine For Clostridium 
Botulinum Neurotoxin ^ioacriaium 

(iii) NUMBER OF SEQUENCES: 82 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Medlen ' & Carroll 

STSf^"^' Montgomery Street, Suite 2200 

tc» CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: United States of America 

(F) ZIP: 94104 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentin Release #l.o. Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 'us 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Carroll, Peter G. 

(B) REGISTRATION NUMBER: 32,837 

(C) REFERENCE/DOCKET NUMBER*: OPHD-02 95g 

(ix) TELECOMMUNICATION INFORMATION- 

(A) TELEPHONE: (415) 705-8410 

(B) TELEFAX: (415) 397-8338 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: 
GGAAATTTAG CTGCAGCATC TGAC 
(2) INFORMATION FOR SEQ ID NO : 2 ; 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
TCTAGCAAAT TCGCTTGTGT TGAA 
(2) INFORMATION FOR SEQ ID NO : 3 : 



10 
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ii) SEQUENCE CHARACTERISTICS; 
<A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTCGCATATA GCATTAGACC 

(2) INFORMATION FOR SEQ ID NO: 4: 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
<xi) SEQUENCE DESCRIPTION: SEQ ID N0:4 : 
25 CTATCTAGGC CTAAAGTAT 

(2) INFORMATION FOR SEQ ID NO: 5: 



M) 



40 



45 



50 



60 



65 



70 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8133 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



-O (ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .8130 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



20 



19 



ATG TCT TTA ATA TCT AAA GAA GAG TTA ATA AAA CTC GCA TAT AGC ATT 48 

Met Ser Leu lie Ser Lys Glu Glu Leu lie Lys Leu Ala Tyr Ser He 

i 5 10 15 

AGA CCA AGA GAA AAT GAG TAT AAA ACT ATA CTA ACT AAT TTA GAC GAA 96 

Arg Pro Arg Glu Asn Glu Tyr Lys Thr He Leu Thr Asn Leu Asp Glu 

20 25 30 

TAT AAT AAG TTA ACT ACA AAC AAT AAT GAA AAT AAA TAT TTG CAA TTA 14 4 

Tyr Asn Lys Leu Thr Thr Asn Asn Asn Glu Asn Lys Tyr Leu Gin Leu 

35 40 45 

55 AAA AAA CTA AAT GAA TCA ATT GAT GTT TTT ATG AAT AAA TAT AAA ACT 192 

Lys Lys Leu Asn Glu Ser He Asp Val Phe Met Asn Lys Tyr Lys Thr 

50 55 60 



TCA AGC AGA AAT AGA GCA CTC TCT AAT CTA AAA AAA GAT ATA TTA AAA 24 0 

Ser Ser Arg Asn Arg Ala Leu Ser Asn Leu Lys Lys Asp He Leu Lys 

65 70 75 ' 80 

GAA GTA ATT CTT ATT AAA AAT TCC AAT ACA AGC CCT GTA GAA AAA AAT 2 88 

Glu Val lie Leu He Lys Asn Ser Asn Thr Ser Pro Val Glu Lys Asn 

85 90 95 

TTA CAT TTT GTA TGG ATA GGT GGA GAA GTC AGT GAT ATT GCT CTT GAA 3 36 

Leu His Phe Val Trp He Gly Gly Glu Val Ser Asp He Ala Leu Glu 

100 105 110 
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TAC ATA AAA CAA TGG GCT GAT ATT AAT GCA GAA TAT AAT ATT AAA CTG 384 
Tyr He Lys Gin Trp Ala Asp He Asn Ala Glu Tyr Asn He Lys Leu 
115 120 125 

> TGG TAT GAT AGT GAA GCA TTC TTA GTA AAT ACA CTA AAA AAG GCT ATA 432 

Trp Tyr Asp Ser Glu Ala Phe Leu Val Asn Thr Leu Lys Lys Ala He 
130 135 

U) w'^T '^^'^ "^^^ 'T'^A CTA CTA GAG GAA GAG ATT 480 

I" Val Glu ser Ser Thr Thr Glu Ala Leu Gin Leu Leu Glu Glu Glu He 

150 155 160 

CAA AAT CCT CAA TTT GAT AAT ATG AAA TTT TAC AAA AAA AGG ATG GAA 
Gin Asn Pro Gin Phe Asp Asn Met Lys Phe Tyr Lys Lys Arg Met Glu 
1€5 170 175 

TTT ATA TAT GAT AGA CAA AAA AGG TTT ATA AAT TAT TAT AAA TCT CAA 
Phe lie Tyr Asp Arg Gin Lys Arg Phe He Asn Tyr Tyr Lys Ser Gin 
180 185 190 

ATC AAT AAA CCT ACA GTA CCT ACA ATA GAT GAT ATT ATA AAG TCT CAT 
He Asn Lys Pro Thr Val Pro Thr He Asp Asp He He Lys Ser His 
195 200 205 

CTA GTA TCT GAA TAT AAT AGA GAT GAA ACT GTA TTA GAA TCA TAT AGA 
uGu Val Ser Glu Tyr Asn Arg Asp Glu Thr Val Leu Glu Ser Tyr Arg 
^10 215 220 



20 



40 



55 



60 



65 



AAG TAT AAA AAA TAT ATA AAT AAT TAT ACA TCA GAA AAC TTT GAT AAA 
Lys Tyr Lys Lys Tyr He Asn Asn Tyr Thr Ser Glu Asn Phe Asp Lys 

325 330 335 

CTT GAT CAA CAA TTA AAA GAT AAT TTT AAA CTC ATT ATA GAA AGT AAA 
Leu Asp Gin Gin Leu Lys Asp Asn Phe Lys Leu lie He Glu Ser Lys 
340 345 350 

AGT GAA AAA TCT GAG ATA TTT TCT AAA TTA GAA AAT TTA AAT GTA TCT 
Ser Glu Lys Ser Glu He Phe Ser Lys Leu Glu Asn Leu Asn val Ser 
355 360 365 

GAT CTT GAA ATT AAA ATA GCT TTC GCT TTA GGC AGT GTT ATA AAT CAA 
Asp Leu Glu He Lys He Ala Phe Ala Leu Gly Ser Val He Asn Gin 
370 375 380 



528 



576 



624 



672 



-W) ^'^^ ^ ^™ AAT CAT GGG ATA GAT ATC AGG 720 

Thr Asn Ser Leu Arg Lys He Asn Ser Asn His Gly He Asp He Arg 

230 235 240 



768 



GCT AAT AGT TTG TTT ACA GAA CAA GAG TTA TTA AAT ATT TAT AGT CAG 
Ala Asn Ser Leu Phe Thr Glu Gin Glu Leu Leu Asn He Tyr Ser Gin 
245 250 255 

GAG TTG TTA AAT CGT GGA AAT TTA GCT GCA GCA TCT GAC ATA GTA AGA 816 
Glu Leu Leu Asn Arg Gly Asn Leu Ala Ala Ala Ser Asp He Val Arg 
260 265 270 

TTA TTA GCC CTA AAA AAT TTT GGC GGA GTA TAT TTA GAT GTT GAT ATG 864 
Leu Leu Ala Leu Lys Asn Phe Gly Gly Val Tyr Leu Asp Val Asp Mec 
275 280 285 

4> CTT CCA GGT ATT CAC TCT GAT TTA TTT AAA ACA ATA TCT AGA CCT AGC 912 

Leu Pro Gly He His Ser Asp Leu Phe Lys Thr He Ser Arg Pro Ser 
290 295 300 

TCT ATT GGA CTA GAC CGT TGG GAA ATG ATA AAA TTA GAG GCT ATT ATG 960 
Ser He Gly Leu Asp Arg Trp Glu Met He Lys Leu Glu Ala He Met 

310 315 320 



1008 



1056 



1104 



1152 
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1296 



1344 



1392 



1440 



1488 



1536 



GCC TTG ATA TCA AAA CAA GGT TCA TAT CTT ACT AAC CTA GTA ATA GAA 1200 
Ala Leu He Ser Lys Gin Gly Ser Tyr Leu Thr Asn Leu Val He Glu 

390 395 400 

CAA GTA AAA AAT AGA TAT CAA TTT TTA AAC CAA CAC CTT AAC CCA GCC 1248 
Gin Val Lys Asn Arg Tyr Gin Phe Leu Asn Gin His Leu Asn Pro Ala 
405 410 415 

ATA GAG TCT GAT AAT AAC TTC ACA GAT ACT ACT AAA ATT TTT CAT GAT 
lie Glu Ser Asp Asn Asn Phe Thr Asp Thr Thr Lys He Phe His Asp 
420 425 430 

TCA TTA TTT AAT TCA GCT ACC GCA GAA AAC TCT ATG TTT TTA ACA AAA 
Ser Leu Phe Asn Ser Ala Thr Ala Glu Asn Ser Met Phe Leu Thr Lys 
435 440 445 

ATA GCA CCA TAC TTA CAA GTA GGT TTT ATG CCA GAA GCT CGC TCC ACA 
He Ala Pro Tyr Leu Gin Val Gly Phe Met Pro Glu Ala Arg Ser Thr 
450 455 460 

ATA AGT TTA AGT GGT CCA GGA GCT TAT GCG TCA GCT TAC TAT GAT TTC 
He Ser Leu Ser Gly Pro Gly Ala Tyr Ala Ser Ala Tyr Tyr Asp Phe 
470 475 480 

ATA AAT TTA CAA GAA AAT ACT ATA GAA AAA ACT TTA AAA GCA TCA GAT 
He Asn Leu Gin Glu Asn Thr He Glu Lys Thr Leu Lys Ala Ser Asp 
485 490 495 

TTA ATA GAA TTT AAA TTC CCA GAA AAT AAT CTA TCT CAA TTG ACA GAA 
Leu He Glu Phe Lys Phe Pro Glu Asn Asn Leu Ser Gin Leu Thr Glu 
500 505 510 

CAA GAA ATA AAT AGT CTA TGG AGC TTT GAT CAA GCA AGT GCA AAA TAT 1584 
Gin Glu He Asn Ser Leu Trp Ser Phe Asp Gin Ala Ser Ala Lys Tyr 
515 520 525 

CAA TTT GAG AAA TAT GTA AGA GAT TAT ACT GGT GGA TCT CTT TCT GAA 163 2 

Gin Phe Glu Lys Tyr Val Arg Asp Tyr Thr Glv Gly Ser Leu Ser Glu 
530 535 ' 540 

GAC AAT GGG GTA GAC TTT AAT AAA AAT ACT GCC CTC GAC AAA AAC TAT 1680 
Asp Acn Gly Val Asp Phe Asn Lys Asn Thr Ala Leu Asp Lys Asn Tvr 
545 550 555 560 

TTA TTA AAT AAT AAA ATT CCA TCA AAC AAT GTA GAA GAA GCT GGA AGT 1728 
Leu Leu Asn Asn Lys He Pro Ser Asn Asn Val Glu Glu Ala Gly Ser 
565 570 575 

AAA AAT TAT GTT CAT TAT ATC ATA CAG TTA CAA GGA GAT GAT ATA AGT 1776 
Lys Asn Tyr Val His Tyr He He Gin Leu Gin Gly Asp Asp He Ser 
580 585 590 

TAT GAA GCA ACA TGC AAT TTA TTT TCT AAA AAT CCT AAA AAT AGT ATT 1824 
Tyr Glu Ala Thr Cys Asn Leu Phe Ser Lys Asn Pro Lys Asn Ser He 
595 600 605 

ATT ATA CAA CGA AAT ATG AAT GAA AGT GCA AAA AGC TAC TTT TTA AGT 1872 
He He Gin Arg Asn Met Asn Glu Ser Ala Lys Ser Tvr Phe Leu Ser 
610 615 620 ' 

GAT GAT GGA GAA TCT ATT TTA GAA TTA AAT AAA TAT AGG ATA CCT GAA 1920 
Asp Asp Gly Glu Ser He Leu Glu Leu Asn Lys Tyr Arg He Pro Glu 

625 630 635 640 

AGA TTA AAA AAT AAG GAA AAA GTA AAA GTA ACC TTT ATT GGA CAT GGT 1968 
Arg Leu Lys Asn Lys Glu Lys Val Lys Val Thr Phe He Gly His Gly 
645 650 655 
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^ ASD 11^ '^'^ AGT GTA GAT TCA 

Lys Asp Glu Phe Asn Thr Ser Glu Phe Ala Arg Leu Ser Val Asp Ser 

665 670 



Ser m u ''^'^ ^'^'^ '^^'^ ^ ^TA GAT ATA 

Leu Ser Asn Glu He Ser Ser Phe Leu Asp Thr He Lys Leu Asp He 

680 685 

Ifr PrI ^ °™ '^^ ^ <5GA TGT AAT ATG TTT AGT 

ser Pro Lys Asn Val Glu Val Asn Leu Leu Gly Cys Asn Met Phe Ser 

695 700 

TAT GAT TTI AAT GTT GAA GAA ACT TAT CCT GGG AAG TTG CTA TTA AGT 
Tyr ASP Phe Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu sir 

710 715 720 

ATT ATG GAC AAA ATT ACT TCC ACT TTA CCT GAT GTA AAT AAA AAT TCT 
lie Met Asp Lys lie Thr Ser Thr Leu Pro AsJ vll A^I ^ ^n III 
725 730 

III ThK XT'" ^f* "^^^ °™ AGA ATT AAT AGT GAG GGA 

He Thr He Gly Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly 
^'*0 745 750 

a™ f^"^ "^"^ ^ TGG ATA AAT AAA GAA GAA 

Arg Lys Glu Leu Leu Ala His Ser Gly Lys Trp He Asn Lys Glu Glu 
'^^ 760 765 



GCT ATT ATG AGC GAT TTA TCT AGT AAA GAA TAC ATT TTT TTT GAT TCT 
Ala He Met Ser Asp Leu Ser Ser Lys Glu Tyr He Phe Phe Asp III 
" 775 780 

He A»o ^"^ ^ T"^^ AAT ATT CCA GGA TTA GCA 

He Asp Asn Lys Leu Lys Ala Lys Ser Lys Asn He Pro Gly Leu Ala 

r11 l^^ ^^"^ ^l'^ ^ T'^A CTT GAT GCA AGT GTT AGT 

Ser He Ser Glu Asp He Lys Thr Leu Leu Leu Asp Ala Ser Val Ser 
805 aio ai5 

CCT GAT ACA AAA TTT ATT TTA AAT AAT CTT AAG CTT AAT ATT GAA TCT 
Pro Asp Thr Lys Phe He Leu Asn Asn Leu Lys Leu Asn He Glu Ser 
820 825 830 

TCT ATT GGG GAT TAC ATT TAT TAT GAA AAA TTA GAG CCT GTT AAA AAT 
ser He Gly Asp Tyr He Tyr Tyr Glu Lys Leu Glu Pro Vai ^ JJJ 
835 340 845 

ATA ATT CAC AAT TCT ATA GAT GAT TTA ATA GAT GAG TTC AAT CTA CTT 

lie ^^"^ '^^P '^^P Phe Asn Leu Leu 

855 860 

GAA AAT GTA TCT GAT GAA TTA TAT GAA TTA AAA AAA TTA AAT AAT CTA 
Glu Asn val Ser Asp Glu Leu Tyr Glu Leu Lys ^ i^l S 

GAT GAG AAG TAT TTA ATA TCT TTT GAA GAT ATC TCA AAA AAT AAT TCA 
ASP Glu Lys Tyr Leu He Ser Phe Glu Asp He Ser Lys Asn Asn ler 
885 890 895 

ACT TAC TCT GTA AGA TTT ATT AAC AAA AGT AAT GGT GAG TCA GTT TAT 
Thr Tyr Ser Val Arg Phe He Asn Lys Ser Asn Gly Glu Ser Val iyl 
900 905 

Sf* *^ A" '"'T TCA AAA TAT AGC GAA CAT ATT ACA 

val Glu Thr Glu Lys Glu He Phe Ser Lys Tyr Ser Glu His lie ThJ 

920 925 



2016 



2064 



2112 



2160 



2208 



2256 



n04 



2352 



2400 



2446 



2496 



2544 



2592 



1:640 



2688 



2736 



2784 
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AAA GAA ATA AGT ACT ATA AAG AAT AGT ATA ATT ACA GAT GTT AAT GGT 2832 
Lys Glu He Ser Thr He Lys Asn Ser He He Thr Asp Val Asn Gly 
930 935 940 

AAT TTA TTG GAT AAT ATA CAG TTA GAT CAT ACT TCT CAA GTT AAT ACA 2880 
Asn Leu Leu Asp Asn He Gin Leu Asp His Thr Ser Gin Val Asn Thr 
945 950 955 960 

TTA AAC GCA GCA TTC TTT ATT CAA TCA TTA ATA GAT TAT AGT AGC AAT 2928 
Leu Asn Ala Ala Phe Phe He Gin Ser Leu He Asp Tyr Ser Ser Asn 
965 970 975 

AAA GAT GTA CTG AAT GAT TTA AGT ACC TCA GTT AAG GTT CAA CTT TAT 2976 
Lys Asp Val Leu Asn Asp Leu Ser Thr Ser Val Lys Val Gin Leu Tyr 
980 985 990 

GCT CAA CTA TTT AGT ACA GGT TTA AAT ACT ATA TAT GAC TCT ATC CAA 3024 
Ala Gin Leu Phe Ser Thr Gly Leu Asn Thr He Tyr Asp Ser He Gin 
995 1000 1005 

TTA GTA AAT TTA ATA TCA AAT GCA GTA AAT GAT ACT ATA AAT GTA CTA 3072 
Leu Val Asn Leu He Ser Asn Ala Val Asn Asp Thr He Asn Val Leu 
1010 1015 1020 

CCT ACA ATA ACA GAG GGG ATA CCT ATT GTA TCT ACT ATA TTA GAC GGA 3120 
Pro Thr He Thr Glu Gly He Pro He Val Ser Thr He Leu Asp Gly 
1025 1030 1035 1040 

ATA AAC TTA GGT GCA GCA ATT AAG GAA TTA CTA GAC GAA CAT GAC CCA 3168 
He Asn Leu Gly Ala Ala He Lys Glu Leu Leu Asp Glu His Asp Pro 
1045 1050 1055 

TTA CTA AAA AAA GAA TTA GAA GCT AAG GTG GGT GTT TTA GCA ATA AAT 3216 
Leu Leu Lys Lys Glu Leu Glu Ala Lys Val Gly Val Leu Ala He Asn 
1060 1065 1070 

ATG TCA TTA TCT ATA GCT GCA ACT GTA GCT TCA ATT GTT GGA ATA GGT 3264 
Met Ser Leu Ser He Ala Ala Thr Val Ala Ser He Val Gly He Gly 
1075 1080 1085 

GCT GAA GTT ACT ATT TTC TTA TTA CCT ATA GCT GGT ATA TCT GCA GGA 3 312 

Ala Glu Val Thr He Phe Leu Leu Pro He Ala Gly He Ser Ala Gly 
1090 1095 1100 

ATA CCT TCA TTA GTT AAT AAT GAA TTA ATA TTG CAT GAT AAG GCA ACT 3 36 0 

He Pro Ser Leu Val Asn Asn Glu Leu He Leu His Asp Lys Ala Thr 
1105 1110 1115 1120 

TCA GTG GTA AAC TAT TTT AAT CAT TTG TCT GAA TCT AAA AAA TAT GGC 3 408 

Ser Val Val Asn Tyr Phe Asn His Leu Ser Glu Ser Lys Lys Tyr Gly 
1125 1130 1135 

CCT CTT AAA ACA GAA GAT GAT AAA ATT TTA GTT CCT ATT GAT GAT TTA 34 56 

Pro Leu Lys Thr Glu Asp Asp Lys He Leu Val Pro He Asp Asp Leu 
1140 1145 1150 

GTA ATA TCA GAA ATA GAT TTT AAT AAT AAT TCG ATA AAA CTA GGA ACA 3 504 

Val He Ser Glu He Asp Phe Asn Asn Asn Ser He Lys Leu Gly Thr 
1155 1160 1165 

TGT AAT ATA TTA GCA ATG GAG GGG GGA TCA GGA CAC ACA GTG ACT GGT 3 552 

Cys Asn He Leu Ala Met Glu Gly Gly Ser Gly His Thr Val Thr Gly 
1170 1175 1180 

AAT ATA GAT CAC TTT TTC TCA TCT CCA TCT ATA AGT TCT CAT ATT CCT 3600 
Asn He Asp His Phe Phe Ser Ser Pro Ser He Ser Ser His He Pro 
1185 1190 1195 1200 



wo 98/08540 



PCTAJS97/15394 



TCA TTA TCA ATT TAT TCT GCA ATA GGT ATA GAA ACA GAA AAT CTA GAT 
Ser Leu Ser lie Tyr Ser Ala He Gly He Glu Thr Giu Asn Leu Asp 
1205 1210 1215 

TTT TCA AAA AAA ATA ATG ATG TTA CCT AAT GCT CCT TCA AGA GTG TTT 
Phe Ser Lys Lys He Met Met Leu Pro Asn Ala Pro Ser Arg Val Phe 
1220 1225 1230 

TGG TGG GAA ACT GGA GCA GTT CCA GGT TTA AGA TCA TTG GAA AAT GAC 
Trp Trp Glu Thr Gly Ala Val Pro Gly Leu Arg Ser Leu Glu Asn Asp 
1235 1240 1245 

GGA ACT AGA TTA CTT GAT TCA ATA AGA GAT TTA TAC CCA GGT AAA TTT 
Oly Thr Arg Leu Leu Asp Ser He Arg Asp Leu Tyr Pro Gly Lys Phe 
1250 1255 1260 

TAC TGG AGA TTC TAT GCT TTT TTC GAT TAT GCA ATA ACT ACA TTA AAA 
Tyr Trp Arg Phe Tyr Ala Phe Phe Asp Tyr Ala He Thr Thr Leu Lys 
^^^^ 1270 1275 1280 

CCA GTT TAT GAA GAC ACT AAT ATT AAA ATT AAA CTA GAT AAA GAT ACT 
Pro val Tyr Glu Asp Thr Asn He Lys He Lys Leu Asp Lys Asp Thr 
1285 1290 1295 

AGA AAC TTC ATA ATG CCA ACT ATA ACT ACT AAC GAA ATT AGA AAC AAA 
Arg Asn Phe He Met Pro Thr He Thr Thr Asn Glu He Arg Asn Lvs 
1300 1305 i3io 

TTA TCT TAT TCA TTT GAT GGA GCA GGA GGA ACT TAC TCT TTA TTA TTA 
Leu Ser Tyr Ser Phe Asp Gly Ala Gly Gly Thr Tyr Ser Leu Leu Leu 
1315 1320 1325 

TCT TCA TAT CCA ATA TCA ACG AAT ATA AAT TTA TCT AAA GAT GAT TTA 
Ser ser Tyr Pro He Ser Thr Asn He Asn Leu Ser Lys Asp Asp Leu 
1330 1335 j^34Q 

TGG ATA TTT AAT ATT GAT AAT GAA GTA AGA GAA ATA TCT ATA GAA AAT 
Trp He Phe Asn He Asp Asn Glu Val Arg Glu He Ser He Glu Asn 
^^^^ 1350 1355 1360 

GGT ACT ATT AAA AAA GGA AAG TTA ATA AAA GAT GTT TTA AGT AAA ATT 
Gly Thr He Lys Lys Gly Lys Leu He Lys Asp Val Leu Ser Lys He 
1365 1370 3^375 

GAT ATA AAT AAA AAT AAA CTT ATT ATA GGC AAT CAA ACA ATA GAT TTT 
Asp He Asn Lys Asn Lys Leu He He Gly Asn Gin Thr He Asp Phe 
1380 1385 

TCA GGC GAT ATA GAT AAT AAA GAT AGA TAT ATA TTC TTG ACT TGT GAG 
Ser Gly Asp He Asp Asn Lys Asp Arg Tyr He Phe Leu Thr Cys Glu 
1395 1400 1405 

TTA GAT GAT AAA ATT AGT TTA ATA ATA GAA ATA AAT CTT GTT GCA AAA 
^'ff.^^P ^^"^ He He Glu He Asn Leu Val Ala Lys 

1410 1415 1420 

TCT TAT AGT TTG TTA TTG TCT GGG GAT AAA AAT TAT TTG ATA TCC AAT 
Ser Tyr Ser Leu Leu Leu Ser Gly Asp Lys Asn Tyr Leu He Ser Asn 
^'*25 1430 1435 ^^^^ 

TTA TCT AAT ACT ATT GAG AAA ATC AAT ACT TTA GGC CTA GAT AGT AAA 
Leu Ser Asn Thr He Glu Lys He Asn Thr Leu Gly Leu Asp Ser Lys 
1445 1450 1455 



3648 



3696 



3744 



3792 



3840 



3888 



3936 



3984 



4032 



4080 



4128 



4176 



4224 



427: 



4320 



4368 
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AAT ATA GCG TAC AAT TAG ACT GAT GAA TCT AAT AAT AAA TAT TTT GGA 4416 
Asn He Ala Tyr Asn Tyr Thr Asp Glu Ser Asn Asn Lys Tyr Phe Gly 
1460 1465 1470 

5 GCT ATA TCT AAA ACA AGT CAA AAA AGC ATA ATA CAT TAT AAA AAA GAC 4464 

Ala He Ser Lys Thr Ser Gin Lys Ser He He His Tyr Lys Lys Asp 
1475 1480 1485 

AGT AAA AAT ATA TTA GAA TTT TAT AAT GAC AGT ACA TTA GAA TTT AAC 4512 
H) Ser Lys Asn He Leu Glu Phe Tyr Asn Asp Ser Thr Leu Glu Phe Asn 
1490 1495 1500 

AGT AAA GAT TTT ATT GCT GAA GAT ATA AAT GTA TTT ATG AAA GAT GAT 4 560 

Ser Lys Asp Phe He Ala Glu Asp He Aon Val Phe Mec Lys Asp Asp 
1> 1505 1510 1515 1520 



20 



40 



50 



ATT AAT ACT ATA ACA GGA AAA TAC TAT GTT GAT AAT AAT ACT GAT AAA 4608 
He Asn Thr He Thr Gly Lys Tyr Tyr Val Asp Asn Asn Thr Asp Lys 
1525 1530 1535 

AGT ATA GAT TTC TCT ATT TCT TTA GTT AGT AAA AAT CAA GTA AAA GTA 4 656 

Ser He Asp Phe Ser He Ser Leu Val Ser Lys Asn Gin Val Lys Val 
1540 1545 1550 



AAT GGA TTA TAT TTA AAT GAA TCC GTA TAC TCA TCT TAC CTT GAT TTT 4 704 

Asn Gly Leu Tyr Leu Asn Glu Ser Val Tyr Ser Ser Tyr Leu Asp Phe 
1555 1560 1565 

GTG AAA AAT TCA GAT GGA CAC CAT AAT ACT TCT AAT TTT ATG AAT TTA 4752 

.n* Val Lys Asn Ser Asp Gly His His Asn Thr Ser Asn Phe Met Asn Leu 
1570 1575 1580 

TTT TTG GAC AAT ATA AGT TTC TGG AAA TTG TTT GGG TTT GAA AAT ATA 4 800 

Phe Leu Asp Asn He Ser Phe Trp Lys Leu Phe Gly Phe Glu Asn He 

•O 1585 1590 1595 1600 



AAT TTT GTA ATC GAT AAA TAC TTT ACC CTT GTT GGT AAA ACT AAT CTT 4 84 8 

Asn Phe Val He Asp Lys Tyr Phe Thr Leu Val Gly Lys Thr Asn Leu 
1605 1610 1615 

GGA TAT GTA GAA TTT ATT TGT GAC AAT AAT AAA AAT ATA GAT ATA TAT 4 8 96 

Gly Tyr Val Glu Phe He Cys Asp Asn Asn Lys Asn He Asp He Tyr 
1620 1625 1630 



4.^ TTT GGT GAA TGG AAA ACA TCG TCA TCT AAA AGC ACT ATA TTT AGC GGA 4 944 

Phe Gly Glu Trp Lys Thr Ser Ser Ser Lys Ser Thr He Phe Ser Gly 
1635 1640 1645 



AAT GGT AGA AAT GTT GTA GTA GAG CCT ATA TAT AAT CCT GAT ACG GGT 4 992 

Asn Gly Arg Asn Val Val Val Glu Pro He Tyr Asn Pro Asp Thr Gly 
1650 1655 1660 



GAA GAT ATA TCT ACT TCA CTA GAT TTT TCC TAT GAA CCT CTC TAT GGA 504 0 

Glu Asp He Ser Thr Ser Leu Asp Phe Ser Tyr Glu Pro Leu Tyr Gly 
."53 1665 1670 1675 1680 



ATA GAT AGA TAT ATA AAT AAA GTA TTG ATA GCA CCT GAT TTA TAT ACA 5088 
He Asp Arg Tyr He Asn Lys Val Leu He Ala Pro Asp Leu Tyr Thr 
1685 1690 1695 

AGT TTA ATA AAT ATT AAT ACC AAT TAT TAT TCA AAT GAG TAC TAC CCT 5136 
Ser Leu He Asn He Asn Thr Asn Tyr Tyr Ser Asn Glu Tyr Tyr Pro 
1700 1705 1710 



6D gag ATT ATA GTT CTT AAC CCA AAT ACA TTC CAC AAA AAA GTA AAT ATA 5184 

Glu Hg He Val Leu Asn Pro Asn Thr Phe His Lys Lys Val Asn He 
1715 1720 1725 
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AAT TTA GAT AGT TCT TCT TTT GAG TAT AAA TGG TCT ACA GAA GGA ACT 
l^eu Asp Ser Ser Ser Phe Glu Tyr Lys Trp Ser Thr Glu Gly Ser 
^^30 1735 1740 

GAC TTT ATT TTA GTT AGA TAC TTA GAA GAA AGT AAT AAA AAA ATA TTA 
Asp Phe He Leu Val Arg Tyr Leu Glu Glu Ser Asn Lys Lys He Leu 
^^^5 1750 1755 1760 

CAA AAA ATA AGA ATC AAA GGT ATC TTA TCT AAT ACT CAA TCA TTT AAT 
Gin Lys He Arg lie Lys Gly He Leu Ser Asn Thr Gin Ser Phe Asn 
I'^^S 1770 1775 

AAA ATG AGT ATA GAT TTT AAA GAT ATT AAA AAA CTA TCA TTA G^A TAT 
Lys Met Ser He Asp Phe Lys Asp He Lys Lys Leu Ser Leu Gly Tyr 
1780 1785 1790 

M*^? c^*^ ir ^ "^^^ ^'^ '^^ AAT GAA TTA GAT AGA 

He Met Ser Asn Phe Lys Ser Phe Asn Ser Glu Asn Glu Leu Asp Arq 
1795 1800 1805 

GAT CAT TTA GGA TTT AAA ATA ATA GAT AAT AAA ACT TAT TAC TAT GAT 
Asp His Leu Gly Phe Lys He He Asp Asn Lys Thr Tyr Tyr Tyr Asp 
lolO 1815 1820 

GAA GAT AGT AAA TTA GTT AAA GGA TTA ATC AAT ATA AAT AAT TCA TTA 
Glu Asp Ser Lys Leu Val Lys Gly Leu He Asn He Asn Asn Ser Leu 
^^^^ Ifi^O 1835 1840 

TTC TAT TTT GAT CCT ATA GAA TTT AAC TTA GTA ACT GGA TGG CAA ACT 
Phe Tyr Phe Asp Pro He Glu Phe Asn Leu Val Thr Gly Trp Gin Thr 
1845 1850 1855 

ATC AAT GGT AAA AAA TAT TAT TTT GAT ATA AAT ACT GGA GCA GCT TTA 
He Asn Gly Lys Lys Tyr Tyr Phe Asp He Asn Thr Gly Ala Ala Leu 

1865 1870 

ACT AGT TAT AAA ATT ATT AAT GGT AAA CAC TTT TAT TTT AAT A/iT GAT 
Thr Ger Tyr Lys He He Asn Gly Lys His Phe Tyr Phe Asn Asn Asp 
1875 1880 1885 

GGT GTG ATG CAG TTG GGA GTA TTT AAA GGA CCT GAT GGA TTT GAA TAT 
^ VoL^^^ ^^"^ Gly Pro Asp Gly Phe Glu Tyr 

1895 1900 

TTT GCA CCT GCC AAT ACT CAA AAT AAT AAC ATA GAA GGT CAG GCT ATA 
, Asn Asn Asn He Glu Gly Gin Ala He 

^^^^ 1910 1915 1920 

GTT TAT CAA AGT AAA TTC TTA ACT TTG AAT GGC AAA AAA TAT TAT TTT 
Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr T-'r Phe 
1925 1930 1935 

GAT AAT AAC TCA AAA GCA GTC ACT GGA TGG AGA ATT ATT AAC AAT GAG 
Asp Asn Asn Ser Lys Ala Val Thr Gly Trp Arg He He Asn Asn Glu 
1940 1945 1950 

AAA TAT TAC TTT AAT CCT AAT AAT GCT ATT GCT GCA GTC GGA TTG CAA 
Lys Tyr Tyr Phe Asn Pro Asn Asn Ala He Ala Ala Val Gly Leu Gin 
1955 1960 1965 

GTA ATT GAC AAT AAT AAG TAT TAT TTC AAT CCT GAC ACT GCT ATC ATC 

"^y^ '^y^ Asn Pro Asp Thr Ala He He 
1^70 1975 1980 

TCA AAA GGT TGG CAG ACT GTT AAT GGT AGT AGA TAC TAC TTT GAT ACT 
Ser Lys Gly Trp Gin Thr Val Asn Gly Ser Arg Tyr Tyr Phe Asp Thr 
^^^^ 1990 1995 *^ 2000 



5232 



5280 



5328 



5376 



5424 



5472 



5520 



5568 



5616 



5664 



5712 



5760 



5808 



5856 



5904 



595: 



6000 
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10 



20 



40 



GAT ACC GCT ATT GCC TTT AAT GGT TAT AAA ACT ATT GAT GGT AAA CAC 604 8 

Asp Thr Ala lie Ala Phe Asn Gly Tyr Lys Thr lie Asp Gly Lys His 
2005 2010 2015 

TTT TAT TTT GAT AGT GAT TGT GTA GTG AAA ATA GGT GTG TTT AGT ACC 6096 

Phe Tyr Phe Asp Ser Asp Cys Val Val Lys lie Gly Val Phe Ser Thr 
2020 2025 2030 

TCT AAT GGA TTT GAA TAT TTT GCA OCT GCT AAT ACT TAT AAT AAT AAC 6144 

Ser Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Tyr Asn Asn Asn 
2035 2040 2045 



ATA GAA GGT CAG GCT ATA GTT TAT CAA AGT AAA TTC TTA ACT TTG AAT 6192 
He Glu Gly Gin Ala He Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn 
2050 2055 2060 



OGT AAA AAA TAT TAC TTT GAT AAT AAC TCA AAA GCA GTT ACC GGA TTG 6240 

Gly Lys Lys Tyr Tyr Phe Asp Asn Asn Ser Lys Ala Val Thr Gly Leu 

2065 2070 2075 2080 

CAA ACT ATT GAT AGT AAA AAA TAT TAC TTT AAT ACT AAC ACT GCT GAA 62 88 

Gin Thr lie Asp Ser Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Glu 
2085 2090 2095 

25 GCA GCT ACT GGA TGG CAA ACT ATT GAT GGT AAA AAA TAT TAC TTT AAT 63 36 

Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lys Lys Tyr Tyr Phe Asn 
2100 2105 2110 

ACT AAC ACT GCT GAA GCA GCT ACT GGA TGG CAA ACT ATT GAT GGT AAA 6384 

M) Thr Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lys 

2115 2120 2125 

AAA TAT TAC TTT AAT ACT AAC ACT GCT ATA GCT TCA ACT GGT TAT ACA 64 32 

Lys Tyr Tyr Phe Asn Thr Asn Thr Ala He Ala Ser Thr Gly Tyr Thr 
21 30 2135 2140 



ATT ATT AAT GGT AAA CAT TTT TAT TTT AAT ACT GAT GGT ATT ATG CAG 6480 
He He Asn Gly Lys His Phe Tyr Phe Asn Thr Asp Gly He Met Gin 
2145 2150 2155 2160 

ATA GGA GTG TTT AAA GGA CCT AAT GGA TTT GAA TAT TTT GCA CCT GCT 6528 
He Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro Ala 
2165 2170 2175 

45 AAT ACG GAT GCT AAC AAC ATA GAA GGT CAA GCT ATA CTT TAC CAA AAT 6576 

Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala He Leu Tyr Gin Asn 
2180 2185 2190 

GAA TTC TTA ACT TTG AAT GGT AAA AAA TAT TAC TTT GGT AGT GAC TCA 66 24 

^0 Glu Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tvr Phe Gly Ser Asp Ser 
2195 2200 ' 2205 

AAA GCA GTT ACT GGA TGG AGA ATT ATT AAC AAT AAG AAA TAT TAC TTT 6672 
Lys Ala Val Thr Gly Trp Arg He He Asn Asn Lys Lys Tyr Tyr Phe 
^.'> 2210 2215 2220 

AAT CCT AAT AAT GCT ATT GCT GCA ATT CAT CTA TGC ACT ATA AAT AAT 6 720 

Asn Pro Aon Asn Ala He Ala Ala He His Leu Cys Thr lie Asn Asn 
2225 2230 2235 2240 

GAC AAG TAT TAC TTT AGT TAT GAT GGA ATT CTT CAA AAT GGA TAT ATT 6 768 

Asp Lys Tyr Tyr Phe Ser Tyr Asp Gly He Leu Gin Asn Gly Tyr He 
2245 2250 2255 

65 ACT ATT GAA AGA AAT AAT TTC TAT TTT GAT GCT AAT AAT GAA TCT AAA 6816 

Thr He Glu Arg Asn Asn Phe Tyr Phe Asp Ala Asn Asn Glu Ser Lys 
2260 2265 2270 
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6960 



7008 



ATG GTA ACA GGA GTA TTT AAA GGA CCT AAT GGA TTT GAG TAT TTT GCA 6864 
Met Val Thr Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala 
2275 2280 2285 

CCT GCT AAT ACT CAC AAT AAT AAC ATA GAA GGT CAG GOT ATA GTT TAG 6912 
Pro Ala Asn Thr His Asn Asn Asn lie Glu Gly Gin Ala He Val Tyr 
2290 2295 2300 

CAG AAC AAA TTC TTA ACT TTG AAT GGC AAA AAA TAT TAT TTT GAT AAT 
Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Asp Asn 
2305 2310 2315 2320 

GAC TCA AAA GCA GTT ACT GGA TGG CAA ACC ATT GAT GGT AAA AAA TAT 
Asp Ser Lys Ala Val Thr Gly Trp Gin Thr He Asp Gly Lys Lys Tyr 
2325 2330 2335 

TAC TTT AAT CTT AAC ACT GCT GAA GCA GCT ACT GGA TGG CAA ACT ATT 7056 
Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He 
2340 2345 2350 

GAT GGT AAA AAA TAT TAC TTT AAT CTT AAC ACT GCT GAA GCA GCT ACT 7104 
Asp Gly Lys Lys Tyr Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr 
2355 2360 2365 

GGA TGG CAA ACT ATT GAT GGT AAA AAA TAT TAC TTT AAT ACT AAC ACT 7152 
Gly Trp Gin Thr lie Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr 
-370 2375 2380 

TTC ATA GCC TCA ACT GGT TAT ACA AGT ATT AAT GGT AAA CAT TTT TAT 7200 
Phe lie Ala Ser Thr Gly Tyr Thr Ser He Asn Gly Lys His Phe Tyr 
-385 2390 2395 2400 

TTT A/vT ACT GAT GGT ATT ATG CAG ATA GGA GTG TTT AAA GGA CCT AAT 724 8 

Phe Asn Thr Asp Gly He Met Gin He Gly Val Phe Lys Gly Pro Asn 
2405 2410 2415 

GGA TTT GAA TAC TTT GCA CCT GCT AAT ACG GAT GCT AAC AAC ATA GAA 7296 
Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu 
2420 2425 2430 

GGT CAA GCT ATA CTT TAC CAA AAT AAA TTC TTA ACT TTG AAT GGT AAA 7344 
Gly Gin Ala He Leu Tyr Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys 
2435 2440 2445 

AAA TAT TAC TTT GGT AGT GAC TCA AAA GCA GTT ACC GGA CTG CGA ACT 73 92 

Lys Tyr Tyr Phe Gly Ser Asp Ser Lys Ala Val Thr Gly Leu Ax-g Thr 
-•150 2455 2460 

ATT GAT GGT AAA AAA TAT TAC TTT AAT ACT AAC ACT GCT GTT GCA GTT 744 0 

He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Val Ala Val 
2465 2470 2475 2480 

ACT GGA TGG CAA ACT ATT AAT GGT AAA AAA TAC TAC TTT AAT ACT AAC 7488 
Thr Gly Trp Gin Thr He Asn Gly Lys Lys Tyr Tyr Phe Asn Thr Asn 
2485 2490 2495 

ACT TCT ATA GCT TCA ACT GGT TAT ACA ATT ATT AGT GGT AAA CAT TTT 7536 
Thr Ser lie Ala Ser Thr Gly Tyr Thr He He Ser Gly Lys His Ph^ 
2500 2505 2510 

TAT TTT AAT ACT GAT GGT ATT ATG CAG ATA GGA GTG TTT AAA GGA CCT 7 584 

Tyr Phe Asn Thr Asp Gly He Met Gin He Gly Val Phe Lys Gly Pro 
2515 2520 2525 

GAT GGA TTT GAA TAC TTT GCA CCT GCT AAT ACA GAT GCT AAC AAT ATA 7632 
Asp Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He 
2530 2535 2540 
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GAA GGT CAA GCT ATA CGT TAT CAA AAT AGA TTC CTA TAT TTA CAT GAC 7680 
Glu Gly Gin Ala He Arg Tyr Gin Asn Arg Phe Leu Tyr Leu His Asp 
2545 2550 2555 2560 

AAT ATA TAT TAT TTT GGT AAT AAT TCA AAA GCG GCT ACT GGT TOG GTA 7728 
Asn He Tyr Tyr Phe Gly Asn Asn Ser Lys Ala Ala Thr Gly Trp Val 
2565 2570 2575 

ACT ATT GAT GGT AAT AGA TAT TAC TTC GAG CCT AAT ACA GCT ATG GGT 7776 
Thr He Asp Gly Asn Arg Tyr Tyr Phe Glu Pro Asn Thr Ala Met Gly 
2580 2585 2590 

GCG AAT GGT TAT AAA ACT ATT GAT AAT AAA AAT TTT TAC TTT AGA AAT 7824 
Ala Asn Gly Tyr Lys Thr He Asp Asn Lys Asn Phe Tyr Phe Arq Asn 
2595 2600 2605 

GGT TTA CCT CAG ATA GGA GTG TTT AAA GGG TCT AAT GGA TTT GAA TAC 7872 
Gly Leu Pro Gin He Gly Val Phe Lys Gly Ser Asn Gly Phe Glu Tyr 
2610 2615 2620 

TTT OCA CCT GCT AAT ACG GAT GCT AAC AAT ATA GAA GGT CAA GCT ATA 7920 
Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala He 
2625 2630 2635 2640 

CCT TAT CAA AAT AGA TTC CTA CAT TTA CTT GGA AAA ATA TAT TAC TTT 7968 
Arg Tyr Gin Asn Arg Phe Leu His Leu Leu Glv Lys He Tyr Tyr Phe 
2645 2650 2655 

GGT AAT AAT TCA AAA GCA GTT ACT GGA TGG CAA ACT ATT AAT GGT AAA 8016 
Gly Asn Asn Ser Lys Ala Val Thr Gly Trp Gin Thr He Asn Gly Lys 
2660 2665 2670 

GTA TAT TAC TTT ATG CCT GAT ACT GCT ATG GCT GCA GCT GGT GGA CTT 8064 
Val Tyr Tyr Phe Mec Pro Asp Thr Ala Met Ala Ala Ala Gly Gly Leu 
2675 2680 2685 

TTC GAG ATT GAT GGT GTT ATA TAT TTC TTT GGT GTT GAT GGA GTA AAA 8112 
Phe Glu He Asp Gly Val He Tyr Phe Phe Gly Val Asp Gly Val Lys 
2690 2695 2700 

GCC CCT GGG ATA TAT GGC TAA 8133 
Ala Pro Gly He Tyr Gly 
2705 2710 

<2) INFORMATION FOR SEO ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ser Leu He Ser Lys Glu Glu Leu He Lys Leu Ala Tyr Ser He 
15 10 15 

Arg Pro Arg Glu Asn Glu Tyr Lys Thr He Leu Thr Asn Leu Asp Glu 
20 25 30 

Tyr Asn Lys Leu Thr Thr Asn Asn Asn Glu Asn Lys Tyr Leu Gin Leu 

35 40 45 

Lys Lys Leu Asn Glu Ser He Asp Val Phe Met Asn Lys Tyr Lys Thr 
50 55 60 

Ser Ser Arg Asn Arg Ala Leu Ser Asn Leu Lys Lys Asp He Leu Lys 
65 70 75 80 
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Glu Val lie Leu He Lys Asn Ser Asn Thr Ser Pro Val Glu Lys Asn 
85 90 95 



10 



Leu His Phe Val Trp He Gly Gly Glu Val Ser Asp He Ala Leu Glu 
100 105 110 

Tyr He Lys Gin Trp Ala Asp He Asn Ala Glu Tyr Asn He Lys Leu 
115 120 125 

Trp Tyr Asp Ser Glu Ala Phe Leu Val Asn Thr Leu Lys Lys Ala He 
130 135 140 



15 



20 



Val Glu Ser Ser Thr Thr Glu Ala Leu Gin Leu Leu Glu Glu Glu He 
^"^5 150 155 160 

Gin Asn Pro Gin Phe Asp Asn Met Lys Phe Tvr Lys Lys Arg Met Glu 
165 170 ' 175 

. Phe He Tyr Asp Arg Gin Lys Arg Phe He Asn Tyr Tyr Lys Ser Gin 
180 185 190 

He Asn Lys Pro Thr Val Pro Thr He Asp Asp He He Lys Ser His 
195 200 205 



30 



35 



40 



Leu Val Ser Glu Tyr Asn Arg Asp Glu Thr Val Leu Glu Ser Tyr Arq 
210 215 220 

Thr Asn Ser Leu Arg Lys He Asn Ser Asn His Glv He Asp He Arg 

230 235 * 240 

Ala Asn Ser Leu Phe Thr Glu Gin Glu Leu Leu Asn He Tyr Ser Gin 
245 250 255 

Glu Leu Leu Asn Arg Gly Asn Leu Ala Ala Ala Ser Asp He Val Arg 
260 265 270 

Leu Leu Ala Leu Lys Asn Phe Gly Gly Val Tyr Leu Asp Val Asp Met 
275 280 •'oc 



285 



Leu Pro Gly He His Ser Asp Leu Phe Lys Thr He Ser Arg Pro Sei 
290 295 300 



45 



50 



60 



65 



Ser He Gly Leu Asp Arg Trp Glu Met He Lys Leu Glu Ala He Met 

310 315 320 

Lys Tyr Lys Lys Tyr He Asn Asn Tyr Thr ser Glu Asn Phe Asp Lvs 
325 330 335 

Leu Asp Gin Gin Leu Lys Asp Asn Phe Lys Leu He He Glu Ser Lys 
340 345 350 

Ser Glu Lys Ser Glu He Phe Ser Lys Leu Glu Asn Leu Asn Val Ser 

355 360 365 

Asp Leu Glu He Lys He Ala Phe Ala Leu Gly Ser Val He Asn Gin 
370 375 380 

Ala Leu He Ser Lys Gin Gly Ser Tyr Leu Thr Asn Leu Val He Glu 
385 390 395 4OO 

Gin Val Lys Asn Arg Tyr Gin Phe Leu Asn Gin His Leu Asn Pro Ala 
405 410 415 

He Glu Ser Asp Asn Asn Phe Thr Asp Thr Thr Lys He Phe His Asp 



420 



425 



430 



Ser Leu Phe Asn Ser Ala Thr Ala Glu Asn Ser Met Phe Leu Thr Lys 
435 440 445 
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lie Ala Pro Tyr Leu Gin Val Gly Phe Met Pro Glu Ala Arg Ser Thr 
450 455 

lie Ser Leu Ser Gly Pro Gly Ala Tyr Ala Ser Ala Tyr Tyr Asp Phe 

470 475 480 

lie Asn Leu Gin Glu Asn Thr He Glu Lys Thr Leu Lvs Ala Ser Asp 
485 490 ' 495 

Leu He Glu Phe Lys Phe Pro Glu Asn Asn Leu Ser Gin Leu Thr Glu 
500 505 510 

Gin Glu He Asn Ser Leu Trp Ser Phe Asp Gin Ala Ser Ala Lys Tyr 
515 520 525 

Gin Phe Glu Lys Tyr Val Arg Asp Tyr Thr Gly Gly Ser Leu Ser Glu 
530 535 540 

Asp Asn Gly Val Asp Phe Asn Lys Asn Thr Ala Leu Asp Lys Asn Tyr 

545 550 555 560 

Leu Leu Asn Asn Lys He Pro Ser Asn Asn Val Glu Glu Ala Gly Ser 
565 570 575 

Lys Aon Tyr Val His Tyr He He Gin Leu Gin Gly Asp Asp He Ser 
580 585 590 

Tyr Glu Ala Thr Cys Asn Leu Phe Ser Lys Asn Pro Lys Asn Ser He 
^595 600 605 

He He Gin Arg Asn Met Asn Glu Ser Ala Lys Ser Tyr Phe Leu Ser 
610 615 620 

Asp Asp Gly Glu Ser He Leu Glu Leu Asn Lys Tyr Arg He Pro Glu 
625 630 635 640 

Arg Leu Lys Asn Lys Glu Lys Val Lys Val Thr Phe He Gly His Gly 
645 650 655 

Lys Asp Glu Phe Asn Thr Ser Glu Phe Ala Arg Leu Ser Val Asp Ser 
660 665 670 

Leu Ser Asn Glu He Ser Ser Phe Leu Asp Thr He Lvs Leu Aso He 
675 680 685 

G€r Pro Lys Asn VaL Glu Val Asn Leu Leu Giv Cys Asn Met Phe Ser 
690 69S * 700 

Tyr Asp Phe Asn Val <;lu Glu Thr Tvr Pro Gly Lyo Leu Leu Leu Ser 
705 710 715 720 

He Met Asp Lys He Thr Ser Thr Leu Pro Asp Val Asn Lys Asn Ser 

725 730 735 

He Thr He Gly Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly 
740 745 750 

Arg Lys Glu Leu Leu Ala His Ser Glv Lys Trp He Asn Lys Glu Glu 
755 760 765 

Ala He Mfet Ser Asp Leu Ser Ser Lys Glu Tyr He Phe Phe Asp Ser 
770 775 780 

He Asp Asn Lys Leu Lys Ala Lys Ser Lys Asn He Pro Gly Leu Ala 
785 790 795 800 

Ser He Ser Glu Asp He Lys Thr Leu Leu Leu Asp Ala Ser Val Ser 
805 810 815 
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Pro Asp Thr Lys Phe 
820 



He Leu Asn 



Asn Leu Lys Leu Asn He Glu Ser 
825 830 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 



Ser He Gly Asp Tyr He Tyr Tyr Glu Lys Leu Glu Pro Val Lys Asn 
835 840 845 

He He His Asn Ser He Asp Asp Leu He Asp Glu Phe Asn Leu Leu 

850 855 860 

Glu Asn Val Ser Asp Glu Leu Tyr Glu Leu Lys Lys Leu Asn Asn Leu 
865 870 875 880 

Asp Glu Lys Tyr Leu He Ser Phe Glu Asp He Ser Lys Asn Asn Ser 
885 890 895 

Thr Tyr Ser Val Arg Phe He Asn Lys Ser Asn Gly Glu Ser Val Tyr 
900 905 910 

Val Giu Thr Glu Lys Glu He Phe Ser Lys Tyr Ser Glu His He Thr 
915 920 925 

Lys Glu He Ser Thr He Lys Asn Ser He He Thr Asp Val Asn Gly 
930 935 940 

Asn Leu Leu Asp Asn He Gin Leu Asp His Thr Ser Gin Val Asn Thr 
945 950 955 960 

Leu Asn Ala Ala Phe Phe He Gin Ser Leu He Asp Tyr Ser Ser Asn 
965 970 975 

Lys Asp Val Leu Asn Asp Leu Ser Thr Ser Val Lys Val Gin Leu Tyr 
980 985 990 

Ala Gin Leu Phe Ser Thr Gly Leu Asn Thr He Tyr Asp Ser He Gin 
995 1000 1005 

Leu Val Asn Leu He Ser Asn Ala Val Asn Asp Thr He Asn Val Leu 
1010 1015 1020 

Pro Thr He Thr Glu Gly He Pro He Val Ser Thr He Leu Asp Gly 
1025 1030 1035 104C 

He Asn Leu Gly Ala Ala He Lys Glu Leu Leu Asp Glu His Asp Pro 
1045 1050 1055 

Leu Leu Lys Lys Glu Leu Glu Ala Lys Val Gly Val Leu Ala He Asn 
1060 1065 1070 

Met Scr Leu Ser He Ala Ala Thr Val Ala Ser He Val Gly He Gly 
1075 1080 1085 

Ala Glu Val Thr He Phe Leu Leu Pro He Ala Gly He Ser Ala Gly 

1090 1095 1100 

He Pro Ser Leu Val Asn Asn Glu Leu He Leu His Asp Lys Ala Thr 
1105 1110 1115 1120 

Ser Val Val Asn Tyr Phe Asn His Leu Ser Glu Ser Lys Lys Tyr Gly 
1125 1130 1135 

Pro Leu Lys Thr Glu Asp Asp Lys He Leu Val Pro He Asp Asp Leu 
1140 1145 1150 

Val He Ser Glu He Asp Phe Asn Asn Asn Ser He Lys Leu Gly Thr 
1155 1160 1165 

Cys Asn He Leu Ala Met Glu Gly Gly Ser Gly His Thr Val Thr Gly 



1170 



1175 



1180 
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Asn lie Asp His Phe Phe Ser Ser Pro Ser He Ser Ser His He Pro 
^^^^ 1190 1195 1200 

ser Leu Ser He Tyr Ser Ala He Gly He Glu Thr Glu Asn Leu Asp 
1205 1210 1215 

Phe Ser Lys Lys He Met Met Leu Pro Asn Ala Pro Ser Arg Val Phe 
1220 1225 1230 

Trp Trp Glu Thr Gly Ala Val Pro Gly Leu Arg Ser Leu Glu Asn Asp 
1235 1240 1245 

Gly Thr Arg Leu Leu Asp Ser He Arg Asp Leu Tyr Pro Glv Lys Phe 
1-50 1255 1260 

Tyr Trp Arg Phe Tyr Ala Phe Phe Asp Tyr Ala He Thr Thr Leu Lvs 
^^^^ 1270 1275 1280 

Pro Val Tyr Glu Asp Thr Asn He Lys He Lys Leu Asp Lys Asp Thr 
1285 1290 1295 

Arg Asn Phe He Met Pro Thr He Thr Thr Asn Glu He Arq Asn Lys 
1300 1305 i3io 

Leu Ser Tyr Ser Phe Asp Gly Ala Gly Gly Thr Tyr Ser Leu Leu Leu 
1.515 1320 1325 

Ser Ser Tyr Pro He Ser Thr Asn He Asn Leu Ser Lvs Asp Asp Leu 
1330 1335 1340 * 

Trp He Phe Asn He Asp Asn Glu val Arg Glu He Ser He Glu Asn 
^^^^ 1350 1355 1360 

Gly Thr He Lys Lys Gly Lys Leu He Lyo Asp Val Leu Ser Lys He 
1365 1370 1375 

Asp He Asn Lys Asn Lys Leu He He Gly Asn Gin Thr He Asp Phe 
1380 1385 1390 

Ser Gly Asp He Asp Asn Lys Asp Arq Tyr He Phe Leu Thr Cys Glu 
1395 1400 1405 

Leu Asp Asp Lys He Ser Leu He He Glu He Asn Leu Val Ala Lys 
1410 1.115 .420 

Ser Tyr Ser Leu Leu Leu Ser Gly Asp Lys Asn Tvr Leu He Scr Asn 
^^25 1430 1435 ■ 1440 

Leu Ser Asn Thr He Glu Lys He Asn Thr Leu Gly Leu Asp Ser Lys 
1445 1450 1455 

Asn He Ala Tyr Asn Tyr Thr Asp Glu Ser Asn Asn Lys Tyr Phe Gly 
1460 1465 1470 

Ala He Ser Lys Thr Ser Gin Lys Ser He He His Tyr Lys Lys Asp 
1475 1480 1485 

Ser Lys Asn He Leu Glu Phe Tyr Asn Asp Ser Thr Leu Glu Phe Asn 
1490 1495 1500 

Ser Lys A^p Phe He Ala Glu Asp He Asn Val Phe Met Lys Asp Asp 
1505 1510 1515 1520 

He Asn Thr He Thr Gly Lys Tyr Tyr Val Asp Asn Asn Thr Asp Lys 
1525 1530 1535 

Ser He Asp Phe Ser He Ser Leu Val Ser Lys Asn Gin Val Lys Val 
1540 1545 1550 
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Asn Gly Leu Tyr Leu Asn Glu Ser Val Tyr Ser Ser Tyr Leu Asp Phe 
1555 1560 1565 

Val Lys Asn Ser Asp Gly His His Asn Thr Ser Asn Phe Met Asn Leu 
1570 1575 1580 

Phe Leu Asp Asn He Ser Phe Trp Lys Leu Phe Gly Phe Glu Asn ll 
1585 1590 1595 



e 

1600 



Asn Phe Val He Asp Lys Tyr Phe Thr Leu Val Gly Lvs Thr Asn Leu 
1605 1610 ' 1615 

Gly Tyr Val Glu Phe He Cys Asp Asn Asn Lys Asn He Asp lie Tyr 
1620 1625 1630 

Phe Gly Glu Trp Lys Thr Ser Ser Ser Lys Ser Thr He Phe Ser Gly 
1635 1640 1645 

Asn Gly Arg Asn Val Val Val Glu Pro He Tyr Asn Pro Asp Thr Glv 
1650 1655 1660 

Glu Asp He Ser Thr Ser Leu Asp Phe Ser Tyr Glu Pro Leu Tvr Gly 
^^65 1670 1675 * 1680 

He Asp Arg Tyr He Asn Lys Val Leu He Ala Pro Asp Leu Tyr Thr 
1685 1690 1695 

Ser Leu He Asn He Asn Thr Asn Tyr Tyr Ser Asn Glu Tyr Tyr P-o 
1700 1705 1710 

Glu He He Val Leu Asn Pro Asn Thr Phe His Lyo Lys Val Asn He 
1715 1720 1725 

Asn Leu Asp Ser Ser Ser Phe Glu Tyr Lys Trp Ser Thr Glu Glv Ser 
1730 1735 1740 

Asp Phe He Leu Val Arg Tyr Leu Glu Glu Ser Asn Lys Lys He Leu 
^■'^^ 1750 1755 1760 

Gin Lyo He Arg He Lys Gly He Leu Ser Asn Thr Gin Ser Phe Asn 
1765 1770 1775 

Lys Met Ser He Asp Phe Lys Asp He Lys Lys Leu Ser Leu Gly Tyr 
1780 1785 1790 

He Met Ser Asn Phe Lys Ser Phe Asn Ser Glu Asn Glu Leu Asp Arq 
1795 1800 1805 

Asp His Leu Gly Phe Lys He He Asp Asn Lys Thr Tvr Tyr Tyr Asd 
1810 1815 1820 ' ' ^ ^ 

Glu Asp Ser Lys Leu Val Lys Gly Leu He Asn He Asn Asn Ser Leu 
1825 1830 1835 1840 

Phe Tyr Phe Asp Pro He Giu Phe Asn Leu Val Thr Gly Trp Gin Thr 
1845 1850 1855 

He Asn Gly Lys Lys Tyr Tyr Phe Asp He Asn Thr Gly Ala Ala Leu 
I860 1865 1870 

Thr Ser Tyr Lys He He Asn Gly Lys His Phe Tyr Phe Asn Asn Asp 
1875 1880 1885 

Gly Val Met Gin Leu Gly Val Phe Lys Gly Pro Asp Gly Phe Glu Tyr 

1890 looe * 



1695 



1900 



Phe Ala Pro Ala Asn Thr Gin Asn Asn Asn He Glu Gly Gin Ala He 
1^°5 1910 1915 1320 



wo 98/08540 PCTAJS97/1S394 



Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe 
1925 1930 1935 

Asp Asn Asn Ser Lys Ala Val Thr Gly Trp Arg lie He Asn Asn Glu 
^ 1940 1945 1950 

Lys Tyr Tyr Phe Asn Pro Asn Asn Ala He Ala Ala Val Gly Leu Gin 
1955 1950 1965 

10 Val He Asp Asn Asn Lys Tyr Tyr Phe Asn Pro Asp Thr Ala He He 
1970 1975 1980 



15 



20 



Ser Lys Gly Trp Gin Thr Val Asn Gly Ser Arg Tyr Tyr Phe Asp Thr 
1985 1990 1995 2000 

Asp Thr Ala He Ala Phe Asn Gly Tyr Lys Thr He Asp Gly Lys His 
2005 2010 2015 

Phe Tyr Phe Asp Ser Asp Cys Val Val Lys He Gly Val Phe Ser Thr 
2020 2025 2030 

Ser Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Tyr Asn Asn Asn 
2035 2040 2045 

2.> He Glu Gly Gin Ala He Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn 
2050 2055 2060 

Gly Lys Lys Tyr Tyr Phe Asp Asn Asn Ser Lys Ala Val Thr Gly Leu 
2065 2070 2075 2080 

M) 

Gin Thr He Asp Ser Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Glu 
2085 2090 2095 

Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lys Lys Tyr Tyr Phe Acn 
2100 2105 2110 



41) 



45 



60 



Thr Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lvs 
2115 2120 2125 

Lys Tyr Tyr Phe Asn Thr Asn Thr Ala He Ala Ser Thr Giy Tyr Thr 
2130 2135 2140 

He He Asn Gly Lys His Phe Tvr Phe Asn Thr Asp Gly He Met Gin 
21-15 2150 2155 2160 

He Giy val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro Ala 
2165 2170 2175 



Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala He Leu Tvr Gin Asn 
^0 2180 2185 2190 

Glu Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Gly Ser Asp Ser 
2195 2200 2205 

Lys Ala Val Thr Gly Trp Arg He He Asn Asn Lys Lys Tyr Tyr Phe 
2210 2215 2220 



Asn Pro Asn Asn Ala He Ala Ala He His Leu Cys Thr He Asn Asn 
2225 2230 2235 2240 

Asp Lys Tyr Tyr Phe Ser Tyr Asp Gly He Leu Gin Asn Gly Tvr He 
2245 2250 2255 



Thr He Glu Arg Asn Asn Phe Tyr Phe Asp Ala Asn Asn Glu Ser Lys 
2260 2265 2270 

Met Val Thr Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala 
2275 2280 2285 
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Pro Ala Asn Thr His Asn Asn Asn lie Glu Gly Gin Ala lie Val Tyr 
2290 2295 2300 

Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Asp Asn 
2305 2310 2315 2320 

Asp Ser Lys Ala Val Thr Gly Trp Gin Thr He Asp Gly Lys Lys Tyr 
2325 2330 2335 

Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He 
2340 2345 2350 

Asp Gly Lys Lys Tyr Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr 
2355 2360 2365 

Gly Trp Gin Thr He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr 
2370 2375 2380 

Phe He Ala Ser Thr Gly Tyr Thr Ser He Asn Gly Lys His Phe Tvr 
2385 2390 2395 2400 

Phe Asn Thr Asp Gly He Met Gin He Gly Val Phe Lvs Gly Pro Asn 
2405 2410 * 2415 

Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu 
2420 2425 2430 

Gly Gin Ala He Leu Tyr Gin Asn Lys Phe Leu Thr Leu Asn Gly Lvs 
2435 2440 2445 

Lys Tyr Tyr Phe Gly Ser Asp Ser Lys Ala Val Thr Gly Leu Arq Thr 
2450 2455 2460 

He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Val Ala Val 
2465 2470 2475 2480 

Thr Gly Trp Gin Thr He Asn Gly Lys Lys Tyr Tyr Phe Asn Thr Asn 
2485 2490 2495 

Thr Ser He Ala Ser Thr Gly Tyr Thr He He Ser Gly Lys His Phe 
2500 2505 2510 

Tyr Phe Asn Thr Asp Gly He Met Gin He Gly Val Phe Lys Gly Pro 
2515 2520 2525 

Asp Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He 
2530 2535 2540 

Glu Gly Gin Ala He Arg Tyr Gin Asn Arg Phe Leu Tyr Leu His Asp 
2545 2550 2555 2560 

Asn He Tyr Tyr Phe Gly Asn Asn Ser Lys Ala Ala Thr Gly Trp Val 
2565 2570 2575 

Thr He Asp Gly Asn Arg Tyr Tyr Phe Glu Pro Asn Thr Ala Met Gly 
2580 2585 2590 

Ala Asn Gly Tyr Lys Thr He Asp Asn Lys Asn Phe Tvr Phe Arq Asn 
2595 2600 2605 

Gly Leu Pto Gin He Gly Val Phe Lys Gly Ser Asn Gly Phe Glu Tvr 
2610 2615 2620 

Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala He 
2625 2630 2635 2640 

Arg Tyr Gin Asn Arg Phe Leu His Leu Leu Gly Lys He Tyr Tyr Phe 
2645 2650 2655 
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Gly Asn Asn Ser Lys Ala Val Thr Gly Trp Gin Thr He Asn Gly Lys 
2660 2665 2670 

Val Tyr Tyr Phe Met Pro Asp Thr Ala Met Ala Ala Ala Gly Gly Leu 
^ 2675 2680 2685 

Phe Glu lie Asp Gly Val He Tyr Phe Phe Gly Val Asp Glv Val Lvs 
2690 2695 2700 

M) Ala Pro Gly He Tyr Gly 
2705 2710 

\2) INFORMATION FOR SEQ ID NO: 7: 

'•^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
ID) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: GEO ID N0:7: 

"^y^ Lys He He Asn Gly Lys His Phe Tyr Phe Asn Asn Asp Gly 

^5 10 15 

Val Met Gin Leu Gly Val Phe Lys Gly Pro Asp Gly Phe Glu Tyr Phe 
20 25 30 

Ala Pro Ala Asn Thr Gin Asn Asn Asn He Glu Gly Gin Ala He Val 

35 40 45 

Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly Lvs Lys Tyr Tvr Phe Asd 
50 55 * 60 ' 

Asn Asn Ser Lys Ala Val Thr Gly Trp Arg He He Asn Asn Glu Lvs 
65 70 75 80 

Tyr Tyr Phe Asn Pro Asn Asn Ala He Ala Ala Val Gly Leu Gin Val 
85 90 95 

He Asp Asn Asn Lys Tyr Tyr Phe Asn Pro Asp Thr Ala He He Ser 
100 105 110 

Lys Gly Trp Gin Thr Val Asn Gly Ser Arg Tvr Tyr Phe Asp Thr Asp 
115 120 125 

Thr Ala He Ala Phe Asn Gly Tyr Lys Thr He Asp Gly Lys His Phe 
130 135 140 

Tyr Phe Asp Ser Asp Cys Val Val Lys He Gly Val Phe Ser Thr Ser 

150 155 160 

Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Tyr Asn Asn Asn He 
165 170 175 

Glu Gly Gin Ala He Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly 
180 185 190 

Lys Lys Tyr Tyr Phe Asp Asn Asn Ser Lys Ala Val Thr Gly Leu Gin 
195 200 205 

. r '^^^ lie Asp Ser Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Glu Ala 

210 215 220 

Ala Thr Gly Trp Gin Thr He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr 
225 230 235 240 



40 



45 



60 
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Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lys Lys 
245 250 255 

Tyr Tyr Phe Asn Thr Asn Thr Ala He Ala Ser Thr Gly Tyr Thr He 
260 265 270 

He Asn Gly Lys His Phe Tyr Phe Asn Thr Asp Gly He Met Gin He 
275 280 285 

Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn 
290 295 300 

Thr Asp Ala Asn Asn He Glu Gly Gin Ala He Leu Tyr Gin Asn Glu 
305 310 315 

Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Gly Ser Asp Ser Lys 
325 330 335 

Ala Val Thr Gly Trp Arg He He Asn Asn Lys Lys Tvr Tyr Phe Asn 
340 345 ' 350 

Pro Asn Asn Ala He Ala Ala He His Leu Cvs Thr He Asn Asn Asp 
355 360 ' 365 

Lys Tyr Tyr Phe Ser Tyr Asp Gly He Leu Gin Asn Gly Tyr He Thr 
370 375 380 

He Glu Arg Asn Asn Phe Tyr Phe Asp Ala Asn Acn Glu Ser Lys Met 
390 395 400 

Val Thr Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro 
405 410 415 

Ala Asn Thr His Asn Asn Asn He Glu Gly Gin Ala Ho Val Tyr Gin 
420 425 430 

Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Aso Asn Asp 
435 440 445 

Ser Lys Ala Val Thr Gly Trp Gin Thr He Asp Gly Lys Lvs Tvr Tvr 
450 455 460 

Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He Asp 
4«>5 470 475 480 

Gly Lys Lys Tyr Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly 
485 490 495 

Trp Gin Thr He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Phe 
500 505 510 

He Ala Ser Thr Gly Tyr Thr Ser He Asn Gly Lys His Phe Tyr Phe 

515 520 525 

Asn Thr Asp Gly He Met Gin He Gly Val Phe Lys Gly Pro Asn Gly 
530 535 540 

Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu Gly 

545 550 555 560 

Gin Ala He Leu Tyr Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys Lvs 
565 570 575 

Tyr Tyr Phe Gly Ser Asp Ser Lys Ala Val Thr Gly Leu Arg Thr He 
580 585 590 

Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Val Ala Val Thr 
595 600 605 
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Gly Trp Gin Thr He Asn Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr 
610 615 620 

Ser He Ala Ser Thr Gly Tyr Thr He He Ser Gly Lys His Phe Tyr 

630 635 640 

Phe Asn Thr Asp Gly He Met Gin He Gly Val Phe Lys Gly Pro Asp 
645 650 655 

Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu 
660 665 670 

Gly Gin Ala He Arg Tyr Gin Asn Arg Phe Leu Tyr Leu His Asp Asn 
675 680 685 

He Tyr Tyr Phe Gly Asn Asn Ser Lys Ala Ala Thr Gly Trp Val Thr 
690 695 700 

He Asp Gly Asn Arg Tyr Tyr Phe Glu Pro Asn Thr Ala Met Gly Ala 
■'OS 710 715 720 

Asn Gly Tyr Lys Thr He Asp Asn Lys Asn Phe Tyr Phe Arg Asn Gly 
725 730 735 

Leu Pro Gin He Gly Val Phe Lys Gly Ser Asn Gly Phe Glu Tvr Phe 
740 745 750 * 

Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala He Arg 
755 760 765 

Tyr Gin Asn Arg Phe Leu His Leu Leu Gly Lys He Tyr Tyr Phe Gly 
770 775 780 

Asn Asn Ser Lys Ala Val Thr Gly Trp Gin Thr He Asn Gly Lys Val 
■^85 790 795 800 



800 

Tyr Tyr Phe Met Pro Asp Thr Ala Met Ala Ala 
805 810 

:2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

Ui) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ser Tyr Lys He He Asn Gly Lys His Phe Tyr Phe Asn Asn Asp Gly 
IS 10 15 

Val Met Gin Leu Gly Val Phe Lys Gly Pro Asp Gly Phe Glu Tyr Phe 
20 25 30 

Ala Pro Ala Asn Thr Gin Asn Asn Asn He Glu Glv Gin Ala He Val 
35 40 * 45 

Tyr T31n Ser Lys Phe Leu Thr Leu Asn Glv Lys Lvs Tyr Tyr Phe Asd 
50 55 60 

Asn Asn Ser Lyo Ala Val Thr Gly Trp Arg He He Asn Asn Glu Lys 
65 70 75 80 

Tyr Tyr Phe Asn Pro Asn Asn Ala He Ala Ala 
85 90 
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40 



50 



00 



65 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 7101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .7098 



48 



96 



144 



SEQUENCE DESCRIPTION: SEQ ID NO: 9; 

ATG AGT TTA GTT AAT AGA AAA CAG TTA GAA AAA ATG GCA AAT GTA AGA 
Met Ser Leu Val Asn Arg Lys Gin Leu Glu Lys Met Ala Asn Val Arg 
20 15 

TTT CGT ACT CAA GAA GAT GAA TAT GTT GCA ATA TTG GAT GCT TTA GAA 
Phe Arg Thr Gin Glu Asp Glu Tyr Val Ala He Leu Asp Ala Leu Glu 
20 25 30 

25 GAA TAT CAT AAT ATG TCA GAG AAT ACT GTA GTC GAA AAA TAT TTA AAA 

Glu Tyr His Asn Met Ser Glu Asn Thr Val Val Glu Lvs Tyr Leu Lys 
^5 40 45 

TTA AAA GAT ATA AAT AGT TTA ACA GAT ATT TAT ATA GAT ACA TAT AAA 190 
M) Leu Lys Asp He Asn Ser Leu Thr Asp He Tyr He Asp Thr Tyr Lys 
^" 55 60 

AAA TCT GGT AGA AAT AAA GCC TTA AAA AAA TTT AAG GAA TAT CTA GTT 240 

■5^ n Phe Lys Glu Tyr Leu Val 

70 75 QQ 

ACA GAA GTA TTA GAG CTA AAG AAT AAT AAT TTA ACT CCA GTT GAG AAA 288 
Thr Glu Val Leu Glu Leu Lys Asn Asn Asn Leu Thr Pro Val Glu Lys 
85 90 95 

AAT TTA CAT TTT GTT TGG ATT GGA GGT CAA ATA AAT GAC ACT GCT ATT 3 36 

Asn Leu His Phe Val Trp He Gly Gly Gin He Asn Asp Thr Ala He 
100 105 110 

4.^ AAT TAT ATA AAT CAA TGG AAA GAT GTA AAT AGT GAT TAT AAT GTT AAT 384 

Asn Tyr He Asn Gin Trp Lys Asp Val Asn Ser Asp Tyr Asn Val Asn 

120 125 

TTT TAT GAT AGT AAT GCA TTT TTG ATA AAC ACA TTG AAA AAA ACT 4 32 

Val Phe Tyr Asp Ser Asn Ala Phe Leu He Asn Thr Leu Lys Lys Thr 
130 135 140 

GTA GTA GAA TCA GCA ATA AAT GAT ACA CTT GAA TCA TTT AGA GAA AAC 480 
Val Val Glu Ser Ala He Asn Asp Thr Leu Glu Ser Phe Arg Glu Asn 
^ ISO 155 160 

TTA AAT GAC CCT AGA TTT GAC TAT AAT AAA TTC TTC AGA AAA CGT ATG 528 
Leu Asn Asp Pro Arg Phe Asp Tyr Asn Lys Phe Phe Arg Lys Arg Met 
165 170 175 

GAA ATA ATT TAT GAT AAA CAG AAA AAT TTC ATA AAC TAC TAT AAA GCT 576 
Glu He He Tyr Asp Lys Gin Lys Asn Phe He Asn Tyr Tyr Lys Ala 
180 185 190 

CAA AGA GAA GAA AAT CCT GAA CTT ATA ATT GAT GAT ATT GTA AAG ACA 6 24 

Gin Arg Glu Glu Asn Pro Glu Leu He He Asp Asp He Val Lys Thr 
195 200 205 
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TAT CTT TCA AAT GAG TAT TCA AAG GAG ATA GAT GAA CTT AAT ACC TAT 672 
Tyr Leu Ser Asn Glu Tyr Ser Lys Glu He Asp Glu Leu Asn Thr Tyr 
210 215 220 

ATT GAA GAA TCC TTA AAT AAA ATT ACA CAG AAT AGT GGA AAT GAT GTT 7 20 

lie Glu Glu Ser Leu Asn Lys He Thr Gin Asn Ser Gly Asn Asp Val 
225 230 235 240 

AGA AAC TTT GAA GAA TTT AAA AAT GGA GAG TCA TTC AAC TTA TAT GAA 768 
Arg Asn Phe Glu Glu Phe Lys Asn Gly Glu Ser Phe Asn Leu Tyr Glu 
245 250 255 

CAA GAG TTG GTA GAA AGG TGG AAT TTA GCT GCT GCT TCT GAC ATA TTA 816 
Gin Glu Leu Val Glu Arg Trp Asn Leu Ala Ala Ala Ser Asp He Leu 
260 265 270 

AGA ATA TCT GGA TTA AAA GAA ATT GGT GGT ATG TAT TTA GAT GTT GAT 864 
Arg He Ser Ala Leu Lys Glu He Gly Gly Met Tyr Leu Asp Val Asd 
275 280 285 

ATG TTA CCA GGA ATA CAA CCA GAC TTA TTT GAG TCT ATA GAG AAA CCT 912 
Met Leu Pro Gly He Gin Pro Asp Leu Phe Glu Ser He Glu Lys Pro 
290 295 300 

AGT TCA GTA ACA GTG GAT TTT TGG GAA ATG ACA AAG TTA GAA GCT ATA 960 
Ser Ger Val Thr Val Asp Phe Trp Glu Met Thr Lvs Leu Glu Ala He 
J05 310 315 * 320 

ATG IxAA TAC AAA GAA TAT ATA CCA GAA TAT ACC TCA GAA CAT TTT GAC 1008 
Met Lys Tyr Lys Glu Tyr He Pro Glu Tyr Thr Ser Glu His Phe Asp 
325 330 335 

ATG TTA GAC GAA GAA GTT CAA AGT AGT TTT GAA TCT GTT CTA GCT TCT 1056 
Met Leu Asp Glu Glu Val Gin Ser Ser Phe Glu Ser Val Leu Ala Ser 
340 345 350 

AAG TCA GAT AAA TCA GAA ATA TTC TCA TCA CTT GGT GAT ATG GAG GCA 1104 
Lys Ser Asp Lys Ser Glu He Phe Ser Ser Leu Gly Asp Met Glu Ala 
355 360 365 

TCA CCA CTA GAA GTT AAA ATT GCA TTT AAT AGT AAG GGT ATT ATA AAT 1152 
Ser Pro Leu Glu Val Lys He Ala Phe Asn Ser Lys Gly He He Asn 
370 375 380 

CAA GGG CTA ATT TCT GTG AAA GAC TCA TAT TGT AGC AAT TTA ATA GTA 1200 
Gin Gly Leu He Ser Val Lys Asp Ser Tyr Cys Ser Asn Leu He Val 
385 390 395 400 

AAA CAA ATC GAG AAT AGA TAT AAA ATA TTG AAT AAT AGT TTA AAT CCA 1248 
Lys Gin He Glu Asn Arg Tyr Lys He Leu Asn Asn Ser Leu Asn Pro 
405 410 415 

GCT ATT AGC GAG GAT AAT GAT TTT AAT ACT ACA ACG AAT ACC TTT ATT 12 96 

Ala He Ser Glu Asp Asn Asp Phe Asn Thr Thr Thr Asn Thr Phe He 
420 425 430 

GAT AGT ATA ATG GCT GAA GCT AAT GCA GAT AAT GGT AGA TTT ATG ATG 1344 
Asp Ser He Met Ala Glu Ala Asn Ala Asp Asn Gly Arg Phe Met Met 
435 440 445 

GAA CTA GGA AAG TAT TTA AGA GTT GGT TTC TTC CCA GAT GTT AAA ACT 13 9:> 

Glu Leu Gly Lys Tyr Leu Arg Val Gly Phe Phe Pro Asp Val Lys Thr 
450 455 460 

ACT ATT AAC TTA AGT GGC CCT GAA GCA TAT GCG GCA GCT TAT CAA GAT 1440 
Thr He Asn Leu Ser Gly Pro Glu Ala Tyr Ala Ala Ala Tyr Gin Asp 
465 470 475 480 
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TTA TTA ATG TTT AAA GAA GGC AGT ATG AAT ATC CAT TTG ATA GAA GCT 1488 
Leu Leu Met Phe Lys Glu Gly Ser Met Asn He His Leu He Glu Ala 
485 490 495 

GAT TTA AGA AAC TTT GAA ATC TCT AAA ACT AAT ATT TCT CAA TCA ACT 1536 
Asp Leu Arg Asn Phe Glu He Ser Lys Thr Asn He Ser Gin Ser Thr 
500 505 510 

GAA CAA GAA ATG GCT AGC TTA TGG TCA TTT GAC GAT GCA AGA GCT AAA 1584 
Glu Gin Glu Met Ala Ser Leu Trp Ser Phe Asp Asp Ala Arg Ala Lys 
515 520 525 

GCT CAA TTT GAA GAA TAT AAA AGG AAT TAT TTT GAA GGT TCT CTT GGT 1632 
Ala Gin Phe Glu Glu Tyr Lys Arg Asn Tyr Phe Glu Gly Ser Leu Gly 
530 535 540 

GAA GAT GAT AAT CTT GAT TTT TCT CAA AAT ATA GTA GTT GAC AAG GAG 1680 
Glu Asp Asp Asn Leu Asp Phe Ser Gin Asn He Val Val Asp Lys Glu 
545 550 555 560 

TAT CTT TTA GAA AAA ATA TCT TCA TTA GCA AGA AGT TCA GAG AGA GGA 1728 
Tyr Leu Leu Glu Lys He iSer Ser Leu Ala Arg Ser Ser Glu Ar-g Gly 
565 570 ' 575 

TAT ATA CAC TAT ATT GTT CAG TTA CAA GGA GAT AAA ATT AGT TAT GAA 1776 
Tyr He His Tyr He Val Gin Leu Gin Gly Asp Lys He Ser Tyr Glu 
580 585 590 

GCA GCA TGT AAC TTA TTT GCA AAG ACT CCT TAT GAT AGT GTA CTG TTT 1824 
Ala Ala Cys Asn Leu Phe Ala Lys Thr Pro Tyr Asp Ser Val Leu Phe 
595 600 605 

CAG AAA AAT ATA GAA GAT TCA GAA ATT GCA TAT TAT TAT AAT CCT GGA 1872 
Gin Lys Asn He Glu Asp Ser Glu He Ala Tyr Tyr Tyr Asn Pro Gly 
610 615 620 

GAT GGT GAA ATA CAA GAA ATA GAC AAG TAT AAA ATT CCA AGT ATA ATT 1920 
Asp Gly Glu He Gin Glu He Asp Lys Tyr Lys He Pro Ser He He 
625 630 635 640 

TCT GAT AGA CCT AAG ATT AAA TTA ACA TTT ATT GGT CAT GGT AAA GAT 1968 
Ser Asp Arg Pro Lys He Lys Leu Thr Phe He Gly His Gly Lys Asp 
645 650 655 

GAA TTT AAT ACT GAT ATA TTT GCA GGT TTT GAT GTA GAT TCA TTA TCC 2016 
Glu Phe Asn Thr Asp He Phe Ala Gly Phe Asp Val Asp Ser Leu Ser 
660 665 670 

ACA GAA ATA GAA GCA GCA ATA GAT TTA GCT AAA GAG GAT ATT TCT CCT 2064 
Thr Glu He Glu Ala Ala He Asp Leu Ala Lys Glu Asp He Ser Pro 
675 680 685 

AAG TCA ATA GAA ATA AAT TTA TTA GGA TGT AAT ATG TTT AGC TAC TCT 2112 
Lys Ser He Glu He Asn Leu Leu Gly Cys Asn Met Phe Ser Tyr Ser 
690 695 700 

ATC AAC GTA GAG GAG ACT TAT CCT GGA AAA TTA TTA CTT AAA GTT AAA 2160 
He Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Lys Val Lys 
705 710 715 720 

GAT AAA ATA TCA GAA TTA ATG CCA TCT ATA AGT CAA GAC TCT ATT ATA 2 208 

Asp Lys He Ser Glu Leu Met Pro Ser He Ser Gin Asp Ser He He 
725 730 735 

GTA AGT GCA AAT CAA TAT GAA GTT AGA ATA AAT AGT GAA GGA AGA AGA 22 56 

Val Ser Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly Arg Arg 
740 745 750 
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23 52 



2400 



2448 



2496 



2544 



2592 



GAA TTA TTG GAT CAT TCT GGT GAA TGG ATA AAT AAA GAA GAA AGT ATT 
Glu Leu Leu Asp His Ser Gly Glu Trp He Asn Lys Glu Glu Ser lie 
^55 760 765 

ATA AAG GAT ATT TCA TCA AAA GAA TAT ATA TCA TTT AAT CCT AAA GAA 
He Lys Asp He Ser Ser Lys Glu Tyr He Ser Phe Asn Pro Lys Glu 

775 780 

AAT AAA ATT ACA GTA AAA TCT AAA AAT TTA CCT GAG CTA TCT ACA TTA 
Asn Lys He Thr Val Lys Ser Lys Asn Leu Pro Glu Leu Ser Thr Leu 

790 795 BOO 

TTA CAA GAA ATT AGA AAT AAT TCT AAT TCA AGT GAT ATT GAA CTA GAA 
Leu Gin Glu He Arg Asn Asn Ser Asn Ser Ser Asp He Glu Leu Glu 
805 810 815 

GAA AAA GTA ATG TTA ACA GAA TGT GAG ATA AAT GTT ATT TCA AAT ATA 
Glu Lys Val Met Leu Thr Glu Cys Glu He Asn Val He Ser Asn He 
820 825 830 

GAT ACG CAA ATT GTT GAG GAA AGG ATT GAA GAA GCT AAG AAT TTA ACT 
Asp Thr Gin He Val Glu Glu Arg He Glu Glu Ala Lys Asn Leu Thr 
935 840 845 

TCT GAC TCT ATT AAT TAT ATA AAA GAT GAA TTT AAA CTA ATA GAA TCT 
Ser Asp Ser He Asn Tyr He Lys Aop Glu Phe Lyo Leu He Glu Ser 
8S0 855 860 

ATT TCT GAT GCA CTA TGT GAC TTA AAA CAA CAG AAT GAA TTA GAA GAT 2640 
lie Ser Asp Ala Leu Cys Asp Leu Lys Gin Gin Asn Glu Leu Glu Asd 
865 870 875 880 

TCT CAT TTT ATA TCT TTT GAG GAC ATA TCA GAG ACT GAT GAG GGA TTT 2688 
Ser His Phe He Ser Phe Glu Asp He Ser Glu Thr Asp Giu Glv Phe 
885 890 895 

AGT ATA AGA TTT ATT AAT AAA GAA ACT GGA GAA TCT ATA TTT GTA GAA 2736 
Ser He Arg Phe He Asn Lys Glu Thr Gly Glu Ser He Phe Val Glu 
900 905 9X0 

ACT GAA AAA ACA ATA TTC TCT GAA TAT GCT AAT CAT ATA ACT GAA GAG 2784 
Thr Glu Lys Thr He Phe Ser Glu Tyr Ala Asn His He Thr Glu Glu 
915 920 925 

ATT TCT AAG ATA AAA GGT ACT ATA TTT GAT ACT GTA AAT GGT AAG TTA 2832 
He Ser Lys He Lys Gly Thr He Phe Asp Thr Val Asn Gly Lys Leu 
930 935 940 

GTA AAA AAA GTA AAT TTA GAT ACT ACA CAC GAA GTA AAT ACT TTA AAT 2880 
Val Lys Lys Val Asn Leu Asp Thr Thr His Glu Val Asn Thr Leu Asn 

950 955 960 

GCT GCA TTT TTT ATA CAA TCA TTA ATA GAA TAT AAT AGT TCT AAA GAA 2 928 

Ala Ala Phe Phe He Gin Ser Leu He Glu Tyr Asn Ser Ser Lys Glu 
965 970 975 

TCT CTT AGT AAT TTA AGT GTA GCA ATG AAA GTC CAA GTT TAC GCT CAA 9976 
Ser Leu Ser Asn Leu Ser Val Ala Met Lys Val Gin Val Tyr Ala Gin 
980 985 990 

TTA TTT AGT ACT GGT TTA AAT ACT ATT ACA GAT GCA GCC AAA GTT GTT 3024 
Leu Phe Ser Thr Gly Leu Asn Thr He Thr Asp Ala Ala Lys Val Val 
995 1000 1005 
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GAA TTA GTA TCA ACT GCA TTA GAT GAA ACT ATA GAC TTA CTT CCT ACA 3072 
Glu Leu Val Ser Thr Ala Leu Asp Glu Thr lie Asp Leu Leu Pro Thr 
1010 1015 1020 

TTA TCT GAA GGA TTA CCT ATA ATT GCA ACT ATT ATA GAT GGT GTA AGT 3120 
Leu Ser Glu Gly Leu Pro lie He Ala Thr He He Asp Giy Val Ser 
^025 1030 1035 1040 

TTA GGT GCA GCA ATC AAA GAG CTA AGT GAA ACG AGT GAC CCA TTA TTA 3 168 

Leu Gly Ala Ala He Lys Glu Leu Ser Glu Thr Ser Asp Pro Leu Leu 
1045 1050 1055 

AGA CAA GAA ATA GAA GCT AAG ATA GGT ATA ATG GCA GTA AAT TTA ACA 3216 
Arg Gin Glu He Glu Ala Lys He Gly He Met Ala Val Asn Leu Thr 
1060 1065 1070 

ACA GCT ACA ACT GCA ATC ATT ACT TCA TCT TTG GGG ATA GCT AGT GGA 3264 
Thr Ala Thr Thr Ala He He Thr Ser Ser Leu Gly He Ala Ser Gly 
1075 1080 1085 

TTT AGT ATA CTT TTA GTT CCT TTA GCA GGA ATT TCA GCA GGT ATA CCA 3 312 

Phe Ser He Leu Leu Val Pro Leu Ala Gly He Ser Ala Gly He Pro 
1090 1095 1100 

AGC TTA GTA AAC AAT GAA CTT GTA CTT CGA GAT AAG GCA ACA AAG GTT 3 36 0 

Ser Leu Val Asn Asn Glu Leu Val Leu Arg Asp Lys Ala Thr Lys Val 
1105 1110 1115 ij^20 

GTA GAT TAT TTT AAA CAT GTT TCA TTA GTT GAA ACT GAA GGA GTA TTT 3408 
Val Asp Tyr Phe Lys His Val Ser Leu Val Glu Thr Glu Gly Val Phe 
1125 1130 1135 

ACT TTA TTA GAT GAT AAA ATA ATG ATG CCA CAA GAT GAT TTA GTG ATA 34 56 

Thr Leu Leu Asp Asp Lys He Met Met Pro Gin Asp Asp Leu Val He 
1140 1145 USD 

TCA GAA ATA GAT TTT AAT AAT AAT TCA ATA GTT TTA GGT AAA TGT GAA 3 504 

Ser Glu He Asp Phe Asn Asn Asn Ser He Val Leu Glv Lys Cys Glu 

1155 1160 1165 

ATC TGG AGA ATG GAA GGT GGT TCA GGT CAT ACT GTA ACT GAT GAT ATA 3 552 

He Trp Arg Met Glu Gly Gly Ser Gly His Thr Val Thr Asp Asp He 
1170 1175 1180 



4.-^ GAT CAC TTC TTT TCA GCA CCA TCA ATA ACA TAT AGA GAG CCA CAC TTA 

Asp His Phe Phe Ser Ala Pro Ser He Thr Tyr Arg Glu Pro Hxs Leu 
H85 1190 1195 1200 



6^ AGA TAT TTT GCT TTT ATA GCT GAT GCT TTA ATA ACA ACA TTA AAA CCA 

Arg Tyr Phe Ala Phe He Ala Asp Ala Leu He Thr Thr Leu Lys Pro 
1265 1270 1275 i280 



3600 



TCT ATA TAT GAC GTA TTG GAA GTA CAA AAA GAA GAA CTT GAT TTG TCA 3 648 

Ser He Tyr Asp Val Leu Glu Val Gin Lys Glu Glu Leu Asp Leu Ser 
1205 1210 1215 

AAA GAT TTA ATG GTA TTA CCT AAT GCT CCA AAT AGA GTA TTT GCT TGG 36 96 

Lys Asp Leu Met Val Leu Pro Asn Ala Pro Asn Arg Val Phe Ala Trp 
1220 1225 1230 

GAA ACA GGA TGG ACA CCA GGT TTA AGA AGC TTA GAA AAT GAT GGC ACA 3 744 

Glu Thr Gly Trp Thr Pro Gly Leu Arg Ser Leu Glu Asn Asp Gly Thr 
1235 1240 1245 

AAA CTG TTA GAC CGT ATA AGA GAT AAC TAT GAA GGT GAG TTT TAT TGG 3 7 92 

Lys Leu Leu Asp Arg He Arg Asp Asn Tyr Glu Gly Glu Phe Tyr Tro 
1250 1255 1260 



3840 
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10 



20 



AGA TAT GAA GAT ACT AAT ATA AGA ATA AAT TTA GAT AGT AAT ACT AGA 388B 
Aig Tyr Glu Asp Thr Asn He Arg He Asn Leu Asp Ser Asn Thr Arg 
12B5 1290 1295 

AGT TTT ATA GTT CCA ATA ATA ACT ACA GAA TAT ATA AGA GAA AAA TTA 3 936 

Ser Phe lie Val Pro He He Thr Thr Glu Tyr He Arg Glu Lys Leu 
1300 1305 1310 

TCA TAT TCT TTC TAT GGT TCA GGA GGA ACT TAT GCA TTG TCT CTT TCT 3 984 

Ser Tyr Ser Phe Tyr Gly Ser Gly Gly Thr Tyr Ala Leu Ser Leu Ser 
1315 1320 1325 



CAA TAT AAT ATG GGT ATA AAT ATA GAA TTA AGT GAA AGT GAT GTT TGG 4 032 

^ Gin Tyr Asn Met Gly He Asn He Glu Leu Ser Glu Ser Asp Val Tro 
'-^ 1330 1335 1340 

ATT ATA GAT GTT GAT AAT GTT GTG AGA GAT GTA ACT ATA GAA TCT GAT 4080 
He He Asp Val Asp Asn val Val Arg Asp Val Thr He Glu Ser Asp 
^345 1350 1355 1360 

AAA ATT AAA AAA GGT GAT TTA ATA GAA GGT ATT TTA TCT ACA CTA AGT 4128 
Lys He Lys Lys Gly Asp Leu He Glu Gly He Leu Ser Thr Leu Ser 
1365 1370 1375 

ATT GAA GAG AAT AAA ATT ATC TTA AAT AGC CAT GAG ATT AAT TTT TCT 4176 
He Glu Glu Asn Lys He He Leu Asn Ser His Glu He Asn Phe Ser 
1380 1385 1390 

GGT GAG GTA AAT GGA AGT AAT GGA TTT GTT TCT TTA ACA TTT TCA ATT 4 224 

Gly Glu Val Asn Gly Ser Asn Gly Phe Val Ser Leu Thr Phe Ser He 
1395 1400 1405 



35 


TTA 

Leu 


GAA GGA 
Glu Gly 
1410 


ATA 

He 


AAT 
Asn 


GCA 
Ala 


ATT ATA 
He He 
1415 


GAA 

Glu 


GTT 
Val 


GAT 
Asp 


TTA TTA 

Leu Leu 
1420 


TCT 
Ser 


AAA 
Lys 


TCA 
Ser 


4272 


40 


TAT AAA 
Tyr Lys 
1425 


TTA 
Leu 


CTT 

Leu 


ATT 
He 


TCT GGC 
Ser Gly 
1430 


GAA 
Glu 


TTA 

Leu 


AAA 
Lys 


ATA TTG 
He Leu 
1435 


ATG 
Met 


TTA 
Leu 


AAT 
Asn 


TCA 
Ser 
1440 


4320 




AAT 
Asn 


CAT 
His 


ATT 
He 


CAA 
Gin 


CAG AAA 
Gin Lys 
1445 


ATA 
He 


GAT 
Asp 


TAT 

Tyr 


ATA GGA 
He Gly 
1450 


TTC 
Phe 


AAT 
Asn 


AGC 
Ser 


GAA TTA 

Glu Leu 
1455 


4368 


45 


CAG 
Gin 


AAA 
Lys 


AAT 
Asn 


ATA CCA 
He Pro 
1460 


TAT 
Tyr 


AGC 
Ser 


TTT 
Phe 


GTA GAT 
Val Asp 
1465 


AGT 
Ser 


GAA 
Glu 


GGA 
Gly 


AAA GAG 
Lys" Glu 
1470 


AAT 
Asn 


4416 


50 


GGT 
Gly 


TTT 
Phe 


ATT AAT 
He Asn 
1475 


GGT 
Gly 


TCA 
Ser 


ACA 
Thr 


AAA GAA 
Lye Glu 
1480 


GGT 
Gly 


TTA 

Leu 


TTT 
Phe 


GTA TCT 
Val Ser 
1485 


GAA 
Glu 


TTA 
Leu 


4464 


55 


CCT 
Pro 


GAT GTA 
Asp Val 
1490 


GTT 
Val 


CTT 
Leu 


ATA 
He 


AGT AAG 
Ser Lys 
1495 


GTT 
val 


TAT 
Tyr 


ATG 
Met 


GAT GAT 
Asp Asp 
1500 


AGT 
Ser 


AAG 
Lys 


CCT 
Pro 


4512 


60 


TCA TTT 
Ser Phe 
1505 


GGA 
Gly 


TAT 
Tyr 


TAT 
Tyr 


AGT AAT 
Ser Asn 
1510 


AAT 
Asn 


TTG 
Leu 


AAA 
Lys 


GAT GTC 
Asp Val 
1515 


AAA 

Lys 


GTT 

Val 


ATA 

He 


ACT 

Thr 
1520 


4560 




AAA 
Lys 


GAT 
Asp 


AAT 
Asn 


GTT 
Val 


AAT ATA 
Asn He 
1525 


TTA 
Leu 


ACA 
Thr 


GGT 
Gly 


TAT TAT 
Tyr Tyr 
1530 


CTT 
Leu 


AAG 
Lys 


GAT 
Asp 


GAT ATA 
Asp He 
1535 


4608 


65 


AAA 
Lys 


ATC 
He 


TCT 
Ser 


CTT 
Leu 
154C 


TCT 
Ser 

1 


TTG 
Leu 


ACT 
Thr 


CTA 
Leu 


CAA GAT 
Gin Asp 
1545 


GAA 
Glu 


AAA 
Lys 


ACT 
Thr 


ATA AAG 
He Lys 
1550 


TTA 
Leu 


4656 
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AAT AGT GTG CAT TTA GAT GAA AGT GGA GTA GCT GAG ATT TTG AAG TTC 
Asn Ser Val His Leu Asp Glu Ser Gly Val Ala Glu He Leu Lys Phe 
1555 1560 1565 

ATG AAT AGA AAA GGT AAT ACA AAT ACT TCA GAT TCT TTA ATG AGC TTT 
Met Asn Arg Lys Gly Asn Thr Asn Thr Ser Asp Ser Leu Met Ser Phe 
1570 1575 1580 

TTA GAA AGT ATG AAT ATA AAA AGT ATT TTC GTT AAT TTC TTA CAA TCT 
Leu Glu Ser Met Asn He Lys Ser He Phe Val Asn Phe Leu Gin Ser 
^5®^ 1590 1595 1600 

AAT ATT AAG TTT ATA TTA GAT GCT AAT TTT ATA ATA AGT GGT ACT ACT 
Asn He Lys Phe He Leu Asp Ala Asn Phe He He Ser Gly Thr Thr 
1605 1610 1615 

TCT ATT GGC CAA TTT GAG TTT ATT TGT GAT GAA AAT GAT AAT ATA CAA 
ser He Gly Gin Phe Glu Phe He Cys Asp Glu Asn Asp Asn He Gin 
1620 1625 1630 

CCA TAT TTC ATT AAG TTT AAT ACA CTA GAA ACT AAT TAT ACT TTA TAT 
Pro Tyr Phe He Lys Phe Asn Thr Leu Glu Thr Asn Tyr Thr Leu Tvr 
1635 1640 1645 

GTA GGA AAT AGA CAA AAT ATG ATA GTG GAA CCA AAT TAT GAT TTA GAT 
Val Gly Asn Arg Gin Asn Met He Val Glu Pro Asn Tyr Asp Leu Asp 
1650 1655 1660 

GAT TCT GGA GAT ATA TCT TCA ACT GTT ATC AAT TTC TCT CAA AAG TAT 
AGP ser Gly Asp He Ser Ser Thr Val He Asn Phe Ser Gin Lvs Tyr 
^^^5 1670 1675 * 1680 

CTT TAT GGA ATA GAC AGT TGT GTT AAT AAA GTT GTA ATT TCA CCA AAT 
Leu Tyr Gly He Asp Ser Cys Val Asn Lys Val Val He Ser Pro Asn 
1685 1690 1695 

ATT TAT ACA GAT GAA ATA AAT ATA ACG CCT GTA TAT GAA ACA AAT AAT 
He Tyr Thr Asp Glu He Asn He Thr Pro Val Tyr Glu Thr Asn Asn 
1700 1705 1710 

ACT TAT CCA GAA GTT ATT GTA TTA GAT GCA AAT TAT ATA AAT GAA AAA 
Thr Tyr Pro Glu Val He Val Leu Asp Ala Asn Tyr He Aon Glu Lys 
1715 1720 1725 

ATA AAT GTT AAT ATC AAT GAT CTA TCT ATA CGA TAT GTA TGG AGT AAT 
He Asn Val Asn He Asn Asp Leu Ser He Arg Tyr Val Trp Ser Asn 
1730 1735 • 1740 

GAT GGT AAT GAT TTT ATT CTT ATG TCA ACT AGT GAA GAA AAT AAG GTG 
Asp Gly Asn Asp Phe He Leu Met Ser Thr Ser Glu Glu Asn Lys Val 
^"^45 1750 1755 1760 

TCA CAA GTT AAA ATA AGA TTC GTT AAT GTT TTT AAA GAT AAG ACT TTG 
Ser Gin Val Lys He Arg Phe Val Asn Val Phe Lys Asp Lys Thr Leu 
1765 1770 1775 

GCA AAT AAG CTA TCT TTT AAC TTT AGT GAT AAA CAA GAT GTA CCT GTA 
Ala Asn Lys Leu Ser Phe Asn Phe Ser Asp Lys Gin Asp Val Pro Val 
1780 1785 1790 

AGT GAA ATA ATC TTA TCA TTT ACA CCT TCA TAT TAT GAG GAT GGA TTG 
Ser Glu He He Leu Ser Phe Thr pro Ser Tyr Tyr Glu Asp Glv Leu 
1795 1800 1805 

ATT GGC TAT GAT TTG GGT CTA GTT TCT TTA TAT AAT GAG AAA TTT TAT 
He Gly Tyr Asp Leu Gly Leu Val Ser Leu Tyr Asn Glu Lys Phe Tvr 
1810 1815 1820 ' 



4704 



4752 



4800 



4848 



4896 



4944 



4992 



5040 



5088 



5136 



5184 



5232 



5280 



5328 



5376 



5424 



5472 
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ATT AAT AAC TTT GGA ATG ATG GTA TCT GGA TTA ATA TAT ATT AAT GAT 5520 
Ue Asn Asn Phe Gly Met Met Val Ser Gly Leu lie Tyr He Asn Asp 
^^25 1830 1635 1840 

TCA TTA TAT TAT TTT AAA CCA CCA GTA AAT AAT TTG ATA ACT GGA TTT 5568 
Ser Leu Tyr Tyr Phe Lys Pro Pro Val Asn Asn Leu He Thr Gly Phe 
1845 1850 1855 

GTG ACT GTA GGC GAT GAT AAA TAG TAC TTT AAT CCA ATT AAT GGT GGA 5616 
Val Thr Val Gly Asp Asp Lys Tyr Tyr Phe Asn Pro He Asn Gly Gly 
I860 1865 1870 

GCT GCT TCA ATT GGA GAG ACA ATA ATT GAT GAC AAA AAT TAT TAT TTC 5664 
Ala Ala Ser lie Gly Glu Thr He He Asp Asp Lys Asn Tyr Tyr Phe 
1875 1880 1885 

AAC CAA AGT GGA GTG TTA CAA ACA GGT GTA TTT AGT ACA GAA GAT GGA 5712 
Asn Gin Ser Gly Val Leu Gin Thr Gly Val Phe Ser Thr Glu AsD Glv 
1890 1895 1900 



TTT AAA 
Phe Lvs 
1905 


TAT 
Tyr 


TTT 
Phe 


GCC 
Ala 


CCA GCT 
Pro Ala 
1910 


AAT 
Asn 


ACA 
Thr 


CTT 
Leu 


GAT GAA 
Asp Glu 
1915 


AAC 
Asn 


CTA 
Leu 


GAA 
Glu 


GGA 
Gly 
1920 


5760 


GAA 
Glu 


GCA 
Ala 


ATT 
He 


GAT 
Asp 


TTT 
Phe 

192! 


ACT 
Thr 


GGA 
Gly 


AAA 
Lys 


TTA 
Leu 


ATT ATT 
He He 
1930 


GAC 
Asp 


GAA 
Glu 


AAT 
Asn 


ATT TAT 
He Tvr 
1935 


5808 


TAT 
Tyr 


TTT 
Phe 


GAT 
Asp 


GAT AAT 
Asp Aon 
1940 


TAT 
Tyr 


AGA 
Arg 


GGA 
Gly 


GCT GTA 
Ala Val 
1945 


GAA 
Glu 


TGG 
Trp 


AAA 
Lys 


GAA TTA 
Glu Leu 
1950 


GAT 
Asp 


5856 


GGT 
Gly 


GAA 
Glu 


ATG CAC 
Met His 
1955 


TAT 
Tyr 


TTT 
Phe 


AGC 
Ser 


CCA GAA 
Pro Glu 
1960 


ACA 
Thr 


GGT AAA 
Gly Lys 


GCT TTT 
Ala Phe 
1965 


AAA 
Lys 


GGT 
Gly 


5904 


CTA 

Leu 


AAT CAA 
Asn Gin 
1970 


ATA 
He 


GGT GAT TAT AAA 
Gly Asp Tyr Lys 
1975 


TAC 
Tyr 


TAT 
Tyr 


TTC 
Phe 


AAT TCT 
Asn Ser 
1980 


GAT 
Asp 


GGA GTT 
Gly Val 


5952 


ATG CAA 
Met Gin 
1985 


AAA 

Lys 


GGA 
Gly 


TTT 
Phe 


GTT AGT 
Val Ser 
1990 


ATA 
He 


AAT 

Asn 


GAT 
Asp 


AAT AAA 
Asn Lys 
1995 


CAC 
His 


TAT 

Tyr 


TTT 
Phe 


GAT 
Asp 
2000 


6000 


GAT 
Asp 


TCT 
Ser 


GGT 
Gly 


GTT 
Val 


ATG AAA 
Met Lys 
2005 


GTA 
Val 


GGT 
Gly 


TAC 
Tyr 


ACT GAA 
Thr Glu 
2010 


ATA 
He 


GAT 
Asp 


GGC 
Gly 


AAG CAT 
Lys His 
2015 


6048 


TTC 
Phe 


TAC 
Tyr 


TTT 
Phe 


GCT GAA 
Ala Glu 
2020 


AAC 
Asn 


GGA 
Gly 


GAA 
Glu 


ATG CAA 
Met Gin 
2025 


ATA 
He 


GGA 
Gly 


GTA 
Val 


TTT AAT 
Phe Asn 
2030 


ACA 
Thr 


6096 


GAA 
Glu 


GAT 
Asp 


GGA TTT 
Gly Phe 
2035 


AAA 
Lys 


TAT 
Tyr 


TTT 
Phe 


GCT CAT 
Ala His 
2040 


CAT 
His 


AAT 
Asn 


GAA 
Glu 


GAT TTA 
Asp Leu 
2045 


GGA 
Gly 


AAT 
Asn 


6144 


GAA 
Glu 


GAA GGT 
Glu Gly 
2050 


GAA 
Glu 


GAA 
Glu 


ATC 
He 


TCA TAT 
Ser Tyr 
2055 


TCT 
Ser 


GGT 
Gly 


ATA 
He 


TTA AAT 
Leu Asn 
2060 


TTC 
Phe 


AAT 
Asn 


AAT 
Asn 


6192 


AAA 

Lys 
2065 


ATT 
He 


TAC 
Tyr 


TAT 
Tyr 


TTT 
Phe 


GAT GAT 
Asp Asp 
2070 


TCA 
Ser 


TTT 
Phe 


ACA 
Thr 


GCT GTA 
Ala Val 
2075 


GTT 
Val 


GGA 
Gly 


TGG 
Trp 


AAA 
Lys 
2080 


6240 


GAT 
Asp 


TTA 
Leu 


GAG 
Glu 


GAT 
Asp 


GGT 
Gly 
2085 


TCA 
Ser 


AAG 
Lys 


TAT 
Tyr 


TAT 
Tyr 


TTT 
Phe 
209C 


GAT GAA 
Asp Glu 

1 


GAT 
Asp 


ACA 
Thr 


GCA GAA 
Ala Glu 
2095 


6288 
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GCA TAT ATA GGT TTG TCA TTA ATA AAT GAT GGT CAA TAT TAT TTT AAT 6336 
Ala Tyr He Gly Leu Ser Leu He Asn Asp Gly Gin Tyr Tyr Phe Asn 
2100 2105 2110 

GAT GAT GGA ATT ATG CAA GTT GGA TTT GTC ACT ATA AAT GAT AAA GTC 6 384 

Asp Asp Gly He Met Gin Val Gly Phe Val Thr He Asn Asp Lys Val 
2115 2120 2125 

TTC TAG TTC TCT GAC TCT GGA ATT ATA GAA TCT GGA GTA CAA AAC ATA 6432 
Phe Tyr Phe Ser Asp Ser Gly He He Glu Ser Gly Val Gin Asn He 
2130 2135 2140 

GAT GAC AAT TAT TTC TAT ATA GAT GAT AAT GGT ATA GTT CAA ATT GGT 64 80 

Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn Gly He Val Gin He Gly 
2145 2150 2155 2160 

GTA TTT GAT ACT TCA GAT GGA TAT AAA TAT TTT GCA CCT GCT AAT ACT 6528 
Val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr Phe Ala Pro Ala Asn Thr 
2165 2170 2175 

GTA AAT GAT AAT ATT TAG GGA CAA GCA GTT GAA TAT AGT GGT TTA GTT 6576 
Val Asn Asp Asn He Tyr Gly Gin Ala Val Glu Tyr Ser Gly Leu Val 
2180 2185 2190 

AGA GTT GGG GAA GAT GTA TAT TAT TTT GGA GAA ACA TAT ACA ATT GAG 6624 
Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly Glu Thr Tyr Thr He Glu 
2195 2200 2205 

ACT GGA TGG ATA TAT GAT ATG GAA AAT GAA AGT GAT AAA TAT TAT TTC 6672 
Thr Gly Trp He Tyr Asp Met Glu Asn Glu Ser Asp Lys Tyr Tyr Phe 
2210 2215 2220 

AAT CCA GAA ACT AAA AAA GCA TGC AAA GGT ATT AAT TTA ATT GAT GAT 6720 
Asn Pro Glu Thr Lys Lys Ala Cys Lys Gly He Asn Leu He Asp Asp 
2225 2230 2235 2240 

ATA AAA TAT TAT TTT GAT GAG AAG GGC ATA ATG AGA ACG GGT CTT ATA 6 768 

He Lys Tyr Tyr Phe Asp Glu Lys Gly He Met Arg Thr Gly Leu He 
2245 2250 2255 

TCA TTT GAA AAT AAT AAT TAT TAC TTT AAT GAG AAT GGT GAA ATG CAA 6816 
Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn Glu Asn Gly Glu Met Gin 
2260 2265 2270 

TTT GGT TAT ATA AAT ATA GAA GAT AAG ATG TTC TAT TTT GGT GAA GAT 6864 
Phe Gly Tyr He Asn He Glu Asp Lys Met Phe Tyr Phe Gly Glu Asp 
2275 2280 2285 

GGT GTC ATG CAG ATT GGA GTA TTT AAT ACA CCA GAT GGA TTT AAA TAC 6912 
Gly Val Met Gin He Gly Val Phe Asn Thr Pro Asp Gly Phe Lys Tyr 
2290 2295 2300 

TTT GCA CAT CAA AAT ACT TTG GAT GAG AAT TTT GAG GGA GAA TCA ATA 6960 
Phe Ala His Gin Asn Thr Leu Asp Glu Asn Phe Glu Gly Glu Ser He 
2305 2310 2315 2320 

AAC TAT ACT GGT TGG TTA GAT TTA GAT GAA AAG AGA TAT TAT TTT ACA 7008 
Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu Lys Arg Tyr Tyr Phe Thr 
2325 2330 2335 

GAT GAA TAT ATT GCA GCA ACT GGT TCA GTT ATT ATT GAT GGT GAG GAG 7056 
Asp Glu Tyr He Ala Ala Thr Gly Ser Val He He Asp Gly Glu Glu 
2340 2345 2350 

TAT TAT TTT GAT CCT GAT ACA GCT CAA TTA GTG ATT AGT GAA 7098 
Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu Val He Ser Glu 
2355 2360 2365 



TAG 



7101 
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(2) INFORMATION FOR SEQ ID NO: 10; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2366 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Leu Val Asn Arg Lys Gin Leu Glu Lys Met Ala Asn Vdl Arg 
^5 10 15 

Phe Arg Thr Gin Glu Asp Glu Tyr Val Ala lie Leu Asp Ala Leu Glu 

20 25 30 

Glu Tyr His Asn Met Ser Glu Asn Thr Val Val Glu Lys Tyr Leu Lys 
35 40 45 ' 

Leu Lys Asp lie Asn Ser Leu Thr Asp He Tyr He Asp Thr Tyr Lvs 

50 55 6 0 

Lys Ser Gly Arg Asn Lys Ala Leu Lys Lys Phe Lys Glu Tyr Leu Val 
^5 70 75 80 

Thr Glu Val Leu Glu Leu Lys Asn Asn Asn Leu Thr Pro Val Glu Lys 
85 90 95 

Asn Leu His Phe Val Trp He Gly Gly Gin He Asn Asp Thr Ala He 
100 105 110 

Asn Tyr He Asn Gin Trp Lys Asp Val Asn Ser Asp Tyr Asn Val Asn 
115 120 125 

Val Phe Tyr Asp Ser Asn Ala Phe Leu He Asn Thr Leu Lys Lvs Thr 
130 135 140 

Val Val Glu Ser Ala He Asn Asp Thr Leu Glu Ser Phe Arg Glu Asn 

150 155 160 

Leu Asn Asp Pro Arg Phe Asp Tyr Asn Lys Phe Phe Arg Lys Arg Met 
165 170 175 

Glu He He Tyr Asp Lys Gin Lys Asn Phe He Asn Tyr Tyr Lys Ala 
180 185 190 

Gin Arg Glu Glu Asn Pro Glu Leu He He Asp Asp He Val Lys Thr 
195 200 205 

Tyr Leu Ser Asn Glu Tyr Ser Lys Glu He Asp Glu Leu Asn Thr Tyr 
210 215 220 

He Glu Glu Ser Leu Asn Lys He Thr Gin Asn Ser Gly Asn Asp Val 

230 235 240 

Arg Asn Phe Glu Glu Phe Lys Asn Gly Glu Ser Phe Asn Leu Tyr Glu 
245 250 255 

Gin Glu Leu Val Glu Arg Trp Asn Leu Ala Ala Ala Ser Asp He Leu 

260 265 270 

Arg He Ser Ala Leu Lys Glu He Gly Gly Met Tyr Leu Asp Val Asp 
275 280 285 

Met Leu Pro Gly He Gin Pro Asp Leu Phe Glu Ser He Glu Lys Pro 
290 295 300 

Ser Ser Val Thr Val Asp Phe Trp Glu Met Thr Lys Leu Glu Ala He 
305 310 3X5 320 
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Met Lys Tyr Lys Glu Tyr He Pro Glu Tyr Thr Ser Glu His Phe Asp 
325 330 335 

Met Leu Asp Glu Glu Val Gin Ser Ser Phe Glu Ser Val Leu Ala Ser 
340 345 350 

Lys Ser Asp Lys Ser Glu He Phe Ser Ser Leu Gly Asp Met Glu Ala 
355 360 365 

Ser Pro Leu Glu Val Lys He Ala Phe Asn Ser Lys Gly He He Asn 
370 375 380 

Gin Gly Leu He Ser Val Lys Asp Ser Tyr Cys Ser Asn Leu lie Val 
385 390 395 400 

Lys Gin He Glu Asn Arg Tyr Lys He Leu Asn Asn Ser Leu Asn Pro 
405 410 415 

Ala He Ser Glu Asp Asn Asp Phe Asn Thr Thr Thr Asn Thr Phe He 
420 425 430 

Asp Ser He Met Ala Glu Ala Asn Ala Asp Asn Gly Arg Phe Met Met 
435 440 445 

Glu Leu Gly Lys Tyr Leu Arg Val Gly Phe Phe Pro Asp Val Lys Thr 
450 455 460 

Thr He Asn Leu Ser Gly Pro Glu Ala Tyr Ala Ala Ala Tyr Gin Asp 
465 470 475 480 

Leu Leu Met Phe Lys Glu Gly Ser Met Asn He His Leu He Glu Ala 
485 490 495 

Asp Leu Arg Asn Phe Glu He Ser Lys Thr Asn He Ser Gin Ser Thr 
500 505 510 

Glu Gin Glu Met Ala Ser Leu Trp Ser Phe Asp Asp Ala Arg Ala Lys 
515 520 525 

Ala Gin Phe Glu Glu Tyr Lys Arg Asn Tyr Phe Glu Gly Ser Leu Gly 
530 535 540 

Glu Asp Asp Asn Leu Asp Phe Ser Gin Asn He Val Val Asp Lys Glu 
545 550 555 560 

Tyr Leu Leu Glu Lys He Ser Ser Leu Ala Arg Ser Ser Glu Arg Gly 

565 570 575 

Tyr lie His Tyr He Val Gin Leu Gin Gly Asp Lys He Ser Tyr Glu 
580 585 590 

Ala Ala Cys Asn Leu Phe Ala Lys Thr Pro Tyr Asp Ser Val Leu Phe 

595 600 605 

Gin Lys Asn He Glu Asp Ser Glu He Ala Tyr Tyr Tyr Asn Pro Gly 
610 615 620 

Asp Gly Glu He Gin Glu He Asp Lys Tyr Lys He Pro Ser He He 
625 630 635 640 

Ser Asp AVg Pro Lys He Lys Leu Thr Phe He Gly His Gly Lvs Asp 
645 650 655 

Glu Phe Asn Thr Asp He Phe Ala Gly Phe Asp Val Asp Ser Leu Ser 
660 665 670 

Thr Glu He Glu Ala Ala He Asp Leu Ala Lys Glu Asp He Ser Pro 
675 680 685 
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Lys Ser He Glu He Asn Leu Leu Gly Cys Asn Met Phe Ser Tyr Ser 

690 695 700 

He Asn val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Lys Val Lys 
705 710 715 720 

Asp Lys He Ser Glu Leu Met Pro Ser He Ser Gin Asp Ser He He 

725 730 735 

Val Ser Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly Arg Arg 
740 745 750 

Glu Leu Leu Asp His Ser Gly Glu Trp He Asn Lys Glu Glu Ser He 

755 760 765 

He Lys Asp He Ser Ser Lys Glu Tyr He Ser Phe Asn Pro Lys Glu 
770 775 780 

Asn Lys He Thr Val Lys Ser Lys Asn Leu Pro Glu Leu Ser Thr Leu 
'^^^ 790 795 800 

Leu Gin Giu He Arg Asn Asn Ser Asn Ser Ser Asp He Glu Leu Glu 
805 810 815 

Glu Lys Val Met Leu Thr Glu Cys Glu He Asn Val He Ser Asn He 
820 825 830 

Asp Thr Gin He Val Giu Glu Arg He Glu Glu Ala Lvs Asn Leu Thr 
835 840 845 

Ser Asp Ser He Asn Tyr He Lys Asp Glu Phe Lys Leu He Glu Ser 
850 855 860 

He Ser Asp Ala Leu Cys Asp Leu Lys Gin Gin Asn Giu Leu Glu Asp 
865 870 875 880 

Ser His Phe He Ser Phe Glu Asp He Ser Glu Thr Asp Glu Gly Phe 
885 890 895 

Ser He Arg Phe He Aon Lys Glu Thr Gly Glu Ser He Phe Val Glu 
900 905 910 

Thr Glu Lys Thr He Phe Ser Glu Tyr Ala Asn His He Thr Glu Glu 
915 920 925 

He Scr Lys He Lys Gly Thr He Phe Asp Thr Val Asn Gly Lys Leu 
930 935 940 

Val Lys Lys Val Asn Leu Asp Thr Thr His Glu Val Asn Thr Leu Asn 
945 950 955 960 

Ala Ala Phe Phe He Gin Ser Leu He Glu Tyr Asn Ser Ser Lys Glu 
965 970 975 

Ser Leu Ser Asn Leu Ser Val Ala Met Lys Val Gin Val Tyr Ala Gin 
980 985 990 

Leu Phe Ser Thr Gly Leu Asn Thr He Thr Asp Ala Ala Lys Val Val 
995 1000 1005 

Glu Leu val Ser Thr Ala Leu Asp Glu Thr He Asp Leu Leu Pro Thr 
1010 1015 1020 

Leu Ser Glu Gly Leu Pro He He Ala Thr He He Asp Gly Val Ser 
1025 1030 1035 1040 

Leu Gly Ala Ala He Lys Glu Leu Ser Glu Thr Ser Asp Pro Leu Leu 
1045 1050 1055 
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Arg Gin Glu He Glu Ala Lys He Gly He Met Ala Val Asn Leu Thr 
1060 1065 1070 

Thr Ala Thr Thr Ala He He Thr Ser Ser Leu Gly He Ala Ser Gly 
1075 1080 1085 

Phe Ser He Leu Leu Val Pro Leu Ala Gly He Ser Ala Gly He Pro 
1090 1095 1100 

f?oc^®" ^^"3 Asp Lys Ala Thr Lys Val 

^1^^ 1110 1115 1120 

Val Asp Tyr Phe Lys His Val Ser Leu Val Glu Thr Glu Gly Val Phe 
1125 1130 1135 

Thr Leu Leu Asp Asp Lys He Met Met Pro Gin Asp Asp Leu Val He 
1140 1145 1150 

Ser Glu He Asp Phe Asn Asn Asn Ser He Val Leu Gly Lys Cys Glu 
1155 1160 1165 

He Trp Arg Met Glu Gly Gly Ser Gly His Thr Val Thr Asp Asd He 
1170 1175 1180 

Asp His Phe Phe Ser Ala Pro Ser He Thr Tyr Arg Glu Pro His Leu 
^1^^ 1190 1195 1200 

Ser He Tyr Asp Val Leu Glu Val Gin Lys Glu Glu Leu Asp Leu Ser 
1205 1210 1215 

Lys Asp Leu Met Val Leu Pro Asn Ala Pro Asn Arg Val Phe Ala Trp 
1220 1225 1230 

Glu Thr Gly Trp Thr Pro Gly Leu Arg Ser Leu Glu Asn Asp Glv Thr 
1235 1240 1245 

Lys Leu Leu Asp Arg He Arg Asp Asn Tyr Glu Gly Glu Phe Tyr Tro 
1250 1255 1260 

Arg Tyr Phe Ala Phe He Ala Asp Ala Leu He Thr Thr Leu Lys Pro 
^2^^ 1270 1275 1280 

Arg Tyr Glu Asp Thr Asn He Arg He Asn Leu Asp Ser Asn Thr Arg 
1285 1290 1295 

Ser Phe He Val Pro He He Thr Thr Glu Tyr He Arg Glu Lys Leu 
1300 1305 1310 

Ser Tyr Ser Phe Tyr Gly Ser Gly Gly Thr Tyr Ala Leu Ser Leu Ser 
1315 1320 1325 

Gin Tyr Asn Met Gly He Asn He Glu Leu Ser Glu Ser Asp Val Tro 
1330 1335 

He He Asp Val Asp Asn Val Val Arg Asp Val Thr He Glu Ser Asp 
^^^^ 1350 1355 1360 

Lys He Lys Lys Gly Asp Leu He Glu Gly He Leu Ser Thr Leu ser 
1365 1370 1375 

He Glu Glu Asn Lys He He Leu Asn Ser His Glu He Asn Phe Ser 
1380 1385 1390 

Gly Glu Val Asn Gly Ser Asn Gly Phe Val Ser Leu Thr Phe Ser He 
1395 1400 1405 

Leu Glu Gly He Asn Ala He He Glu Val Asp Leu Leu Ser Lys Ser 
1410 1415 1420 
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Tyr Lys Leu Leu lie Ser Gly Glu Leu Lys lie Leu Met Leu Asn Set 
1425 1430 1435 1440 

Asn His He Gin Gin Lys He Asp Tyr He Gly Phe Asn Ser Glu Leu 
1445 1450 1455 

Gin Lys Asn He Pro Tyr Ser Phe Val Asp Ser Glu Gly Lys Glu Asn 
1460 1465 1470 

Gly Phe He Asn Gly Ser Thr Lys Glu Gly Leu Phe Val Ser Glu Leu 
1475 1480 1485 

Pro Asp val Val Leu He Ser Lys Val Tyr Met Asp Asp Ser Lys Pro 
1490 1495 1500 

Ser Phe Gly Tyr Tyr Ser Asn Asn Leu Lys Asp Val Lys Val He Thr 
1505 1510 1515 1520 

Lys Asp Asn Val Asn He Leu Thr Gly Tyr Tyr Leu Lys Asp Asp He 
1525 1530 1535 

Lys He Ser Leu Ser Leu Thr Leu Gin Asp Glu Lys Thr He Lys Leu 
1540 1545 X550 

Asn Ser Val His Leu Asp Glu Ser Gly Val Ala Glu He Leu Lys Phe 
1555 1560 1565 

Met Asn Arg Lys Gly Asn Thr Asn Thr Ser Asp Ser Leu Met Ser Phe 
1570 1575 1580 

Leu Glu Ser Met Asn He Lys Ser He Phe Val Aon Phe Leu Gin Ser 
1585 1590 1595 1600 

Asn He Lys Phe He Leu Asp Ala Asn Phe He He Ser Gly Thr Thr 
1605 1610 1615 

Ser He Gly Gin Phe Glu Phe He Cys Asp Glu Asn Asp Asn He Gin 
1620 1625 1630 

Pro Tyr Phe He Lys Phe Asn Thr Leu Glu Thr Asn Tyr Thr Leu Tyr 
1635 1640 1645 

Val Gly Asn Arg Gin Asn Met He Val Glu Pro Asn Tyr Asp Leu Asp 
1650 1655 1660 

Asp Ser Gly Asp He Ser Ser Thr Val He Asn Phe Ser Gin Lys Tyr 
1665 1670 1675 1680 

Leu Tyr Gly He Asp Ser Cys Val Asn Lys Val Val He Ser Pro Asn 
1685 1690 1695 

He Tyr Thr Asp Glu He Asn He Thr Pro Val Tyr Glu Thr Asn Asn 
1700 1705 1710 

Thr Tyr Pro Glu Val He Val Leu Asp Ala Asn Tyr He Asn Glu Lys 
1715 1720 1725 

He Asn Val Asn He Asn Asp Leu Ser He Arg Tyr Val Trp Ser Asn 
1730 1735 1740 

Asp Gly ASn Asp Phe He Leu Met Ser Thr Ser Glu Glu Asn Lys Val 
1745 1750 1755 1760 

Ser Gin Val Lys He Arg Phe Val Asn Val Phe Lys Asp Lys Thr Leu 
1765 1770 1775 

Ala Asn Lys Leu Ser Phe Asn Phe Ser Asp Lys Gin Asp Val Pro Val 
1780 1785 1790 
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Ser Glu lie lie Leu Ser Phe Thr Pro Ser Tyr Tyr Glu Asp Gly Leu 
1795 1800 1805 

He Gly Tyr Asp Leu Gly Leu Val Ser Leu Tyr Asn Glu Lvs Phe Tyr 
1810 1815 1820 

He Asn Asn Phe Gly Met Mec Val Ser Gly Leu He Tyr He Asn Asp 
1825 1830 1835 1840 

Ser Leu Tyr Tyr Phe Lys Pro Pro Val Asn Asn Leu He Thr Gly Phe 
1845 1850 1855 

Val Thr Val Gly Asp Asp Lys Tyr Tyr Phe Asn Pro He Asn Gly Gly 
I860 1865 1870 

Ala Ala Ser He Gly Glu Thr He He Asp Asp Lys Asn Tyr Tyr Ph'=» 
1875 1880 1885 

Asn Gin Ser Gly Val Leu Gin Thr Gly Val Phe Ser Thr Glu Asp Gly 
1890 1895 1900 

Phe Lys Tyr Phe Ala Pro Ala Asn Thr Leu Asp Glu Asn Leu Glu Gly 
1905 1910 1915 1920 

Glu Ala He Asp Phe Thr Gly Lys Leu He He Asp Glu Asn He Tyr 
1925 1930 1935 

Tyr Phe Asp Asp Asn Tyr Arg Gly Ala Val Glu Trp Lys Glu Leu Asn 
1940 1945 1950 

Gly Glu Met His Tyr Phe Ser Pro Glu Thr Gly Lys Ala Phe Lvs Glv 
1955 I960 1965 

Leu Asn Gin He Gly Asp Tyr Lys Tyr Tyr Phe Asn Ser Asp Gly Val 
1970 1975 1980 

Met Gin Lys Gly Phe Val Ser He Asn Asp Asn Lys His Tyr Phe Asp 
1985 1990 1995 2000 

40 Asp Ser Gly Val Met Lys Val Gly Tyr Thr Glu He Asp Gly Lys His 

2005 2010 2015 

Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin He Gly Val Phe Asn Thr 
2020 2025 2030 

Glu Asp Gly Phe Lys Tyr Phe Ala His His Asn Glu Asp Leu Glv Asn 
2035 2040 204S 

Glu Glu Gly Glu Glu He Ser Tyr Ser Gly He Leu Asn Phe Asn Asn 
J" 2050 2055 2060 

Lys He Tyr Tyr Phe Asp Asp Ser Phe Thr Ala Val Val Gly Trp Lvs 
2065 2070 2075 2080 

55 Asp Leu Glu Asp Gly Ser Lys Tyr Tyr Phe Asp Glu Asp Thr Ala Glu 

2085 2090 2095 



45 



60 



Ala Tyr He Gly Leu Ser Leu He Asn Asp Gly Gin Tyr Tyr Phe Asn 
2100 2105 2110 

Asp Asp Gly He Met Gin Val Gly Phe Val Thr II « Asn Asp Lvs Vai 
2115 2120 2125 

Phe Tyr Phe Ser Asp Ser Gly I}e He Glu Ser Gly Val Gin Asn lie 
2130 21?.5 2140 

Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn Gly He Val Gin He Gly 
2145 2150 2155 2160 

70 Val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr Phe Ala Pro Ala Asn Thr 
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2165 2170 2175 

Val Asn Asp Asn He Tyr Gly Gin Ala Val Glu Tyr Ser Gly Leu Val 
5 ^^9° 2185 2190 

Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly Glu Thr Tyr Thr lie Glu 
2195 2200 2205 

10 '^^'^ ^^^J""^ ^^"^ Ser Asp Lys Tyr Tyr Phe 

-^^-LU 2215 2220 

Asn Pro Giu Thr Lys J-VS Ala Cys Lys Gly He Asn ieu He Asp Asp 
'""^ 2230 2235 2240 

15 lie Lys Tyr Tyr Phe Asp Glu Lys Gly lie Met Arg Thr Gly Leu He 

2245 2250 2255 

Ser Phe Giu Asn Asn Asn Tyr Tyr Phe Asn Glu Asn Gly Glu Met Gin 
20 2265 2270 

Phe Gly Tyr He Asn He Glu Asp Lys Met Phe Tyr Phe Gly Glu Asp 
"275 2280 2285 

^^L"^^ ^^"^ Thr Pro Asp Gly Phe Lys Tyr 

-290 2295 2300 

Phe Ala His Gin Asn Thr Leu Asp Glu Asn Phe Giu Gly Giu Ser He 
2310 2315 2320 

Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu Lys Arg Tyr Tyr Phe Thr 
2325 2330 2335 

Asp Glu Tyr He Ala Ala Thr Gly Ser Val He He Asp Glv Glu Glu 
2340 2345 2350 

Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu Val He Ser Glu 
2355 2360 2365 

(2) INFORMATION FOR SEQ ID NO: 11: 

4u 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
*C) STRANDEDNESS : Single 
<n) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TAGAAAAAAT GGCAAATGT 

(2) INFORMATION FOR SEQ ID NO: 12: 

55 (i) SEQUENCE CHARACTERISTICS: 

lA) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
65 TTTCATCTTG TAGAGTCAAA G 

(2) INFORMATION FOR SEQ ID N0:13: 



30 



3,^ 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 



256 



WO98A)8540 



PCT/US97/15394 



(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GATGCCACAA GATGATTTAG TG 

(2) INFORMATION FOR SEQ ID NO; 14: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CTAATTGAGC TGTATCAGGA TC 

(2) INFORMATION FOR SEQ ID NO: IS: 

(it SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IS: 

CGGAATTCCT AGAAAAAATG GCAAATG 

(2) INFORMATION FOR SEQ ID NO: 16: 

< i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GCTCTAGAAT GACCATAAGC TAG CCA 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 
IC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGGAATTCGA GTTGGTAGAA AGGTGGA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

^ tii) MOLECULE TYPE: DNA (genomic) 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CGGAATTCGG TTATTATCTT AAGGATG 27 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
'■^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



30 



40 



4S 



Hi) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGGAATTCTT GATAACTGGA TTTGTGAC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; unknown 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Leu He Thr Gly Phe Vai Thr Val Gly Asp Asp Lys Tyr Tyr Phe Asn 
Is 10 



15 



Pro He Asn Gly Gly Ala Ala Ser He Gly Glu Thr He He Asp Asp 
20 25 30 

Lys Asn Tyr Tyr Phe Asn Gin Ser Gly Val Leu Gin Thr Gly Val Phe 
35 40 45 

Ser Thr Glu Asp Gly Phe Lys Tyr Phe Ala Pro Ala Asn Thr Leu Asp 
50 55 60 

^■^^ lie Asp Phe Thr Gly Lys Leu He He 

-"65 70 7 5 80 

Asp Glu Asn He Tyr Tyr Phe Asp Asp Asn Tyr Arg Gly Ala Val Glu 
85 90 95 

"T^P ^ys Glu Leu Asp Gly Glu Met His Tyr Phe Ser Pro Glu Thr Gly 
100 105 110 

Lys Ala Phe Lys Gly Leu Asn Gin He Gly Asp Tyr Lys Tyr Tvr Phe 

^25 

Asn Ser Asp Gly Val Met Gin Lys Gly Phe Val Ser He Asn Asp Asn 
130 135 140 

^ys His Tyr Phe Asp Asp Ser Gly Val Met Lys Val Gly Tyr Thr Glu 
145 150 155 160 

He Asp Gly Lys His Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin He 
165 170 175 

Gly Val Phe Asn Thr Glu Asp Gly Phe Lys Tyr Phe Ala His His Asn 



28 
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180 185 190 

Glu Asp Leu Gly Asn Glu Glu Gly Glu Glu He Ser Tyr Ser Gly He 
195 200 205 

Leu Asn Phe Asn Asn Lys He Tyr Tyr Phe Asp Asp Ser Phe Thr Ala 
210 215 220 

in Y^^ Trp Lys Asp Leu Glu Asp Glv Ser Lys Tyr Tyr Phe Asp 

' ' 225 230 235 240 

Glu Asp Thr Ala Glu Ala Tyr lie Gly Leu Ser Leu He Asn Asp Glv 
245 25r 255 

Gin Tyr Tyr Phe Asn Asp Asp Gly He Met Gin Val Gly Phe Val Thr 
260 265 270 

He Asn Asp Lys Val Phe Tyr Phe Ser Asp Ser Gly He He Glu Ser 
275 280 285 

Gly Val Gin Asn He Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn Gly 
*90 295 300 

He Val Gin He Gly Vai Phe Asp Thr Ser Asp Gly Tyr Lys Tyr Phe 

310 315 320 

Ala Pro Ala Asn Thr Val Asn Asp Asn He Tyr Gly Gin Ala Val Glu 
325 330 

Tyr Ser Gly Leu Val Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly Glu 
340 345 

Thr Tyr Thr He Glu Thr Gly Trp He Tyr Asp Met Glu Asn Glu Ser 
355 360 365 

Asp Lys Tyr Tyr Phe Asn Pro Glu Thr Lys Lys Ala Cyc Lys Glv He 
370 375 380 

Asn Leu He Asp Asp He Lys Tyr Tyr Phe Asp Glu Lys Gly He Met 
385 390 395 400 

Arg Thr Gly Leu He Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn Glu 
405 410 415 

Asn Gly Glu Met Gin Phe Gly Tyr He Asn He Glu Asp Lvs Mcr Phe 
420 425 430 

Tyr Phe Gly Glu Asp Gly Val Met Gin He Gly Val Phe Asn Thr Pro 

435 440 445 

Asp Gly Phe Lys Tyr Phe Ala His Gin Asn Thr Leu Asp Glu Asn Phe 
450 455 460 

Glu Gly Glu Ser He Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu Lys 
"^^^ 470 475 480 

Arg Tyr Tyr Phe Thr Asp Glu Tyr He Ala Ala Thr Gly Ser Val He 
485 490 495 

He Asp Gly Glu Glu Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu 
500 505 510 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 amino acids 

(B) TYPE: amino acid 

IC) STRANDEDNESS: unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ser Glu Glu Asn Lys Val Ser Gin Val Lys lie Arg Phe Val Asn Val 
15 10 15 

Phe Lys Asp Lys Thr Leu Ala Asn Lys Leu Ser Phe Asn Phe Ser Asp 
20 25 30 

Lys Gin Asp Val Pro Val Ser Glu lie He Leu Ser Phe Thr Pro Ser 
35 40 45 

Tyr Tyr Glu Asp Gly Leu He Gly Tyr Asp Leu Gly Leu Val Ser Leu 
50 55 60 

Tyr Asn Glu Lys Phe Tyr He Asn Asn Phe Gly Met Met Val Ser Gly 
65 70 75 80 

Leu He Tyr He Asn Asp Ser Leu Tyr Tyr Phe Lys Pro Pro Val Asn 
85 90 95 

Asn Leu He Thr Gly Phe Val Thr Val Gly Asp Asp Lys Tyr Tyr Phe 
100 105 110 



Asn Pro He Asn Gly Gly Ala Ala Ser He Gly Glu Thr He He Asp 
•^^"^ 115 120 125 



Asp Lys Asn Tyr Tyr Phe Asn Gin Ser Gly Val Leu Gin Thr Gly Val 
130 135 140 

Phe Ser Thr Glu Asp Gly Phe Lys Tyr Phe Ala Pro Ala Aon Thr Leu 
145 150 155 160 

Asp Glu Asn Leu Glu Gly Glu Ala He Asp Phe Thr Gly Lys Leu He 
165 170 175 

He Asp Glu Asn He Tyr Tyr Phe Asp Asp Asn Tyr Arg Gly Ala Val 

180 185 190 



Glu Trp Lys Glu Leu Asp Gly Glu Met His Tyr Phe Ser Pro Glu Thr 
195 200 2 05 

Gly Lys Ala Phe Lys Gly Leu Asn Gin He Gly Asp Tyr Lvs Tyr Tvr 

210 215 220 

55 Phe Asn Ser Asp Gly Val Met Gin Lys Gly Phe Val Ser He Asn Asp 

225 230 235 240 



Asn Lys His Tyr Phe Asp Asp Ser Gly Val Met Lys Val Gly Tyr Thr 
245 250 255 

Glu He Asp Gly Lys His Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin 
260 265 270 



He Gly Val Phe Asn Thr Glu Asp Gly Phe Lys Tyr Phe Ala His His 
6^ 275 280 285 

Asn Glu Asp Leu Gly Asn Glu Glu Gly Glu Glu He Ser Tyr Ser Gly 
290 295 300 

70 He Leu Asn Phe Asn Asn Lys He Tyr Tyr Phe Asp Asp Ser Phe Thr 
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305 310 315 320 

Ala Val Val Gly Trp Lys Asp Leu Glu Asp Gly Ser Lys Tyr Tyr Phe 
325 330 335 

Asp Glu Asp Thr Ala Glu Ala Tyr He Gly Leu Ser Leu He Asn Asp 
340 345 350 

Gly Gin Tyr Tyr Phe Asn Asp Asp Gly He Met Gin Val Gly Phe Val 
355 360 365 

Thr He Asn Asp Lys Val Phe Tyr Phe Ser Asp Ser Gly He He Glu 
370 375 380 

Ser Gly Val Gin Asn He Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn 
385 390 395 400 

Gly He Val Gin He Gly Val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr 
405 410 415 

Phe Ala Pro Ala Asn Thr Val Asn Asp Asn He Tyr Gly Gin Ala Val 
420 425 430 

Glu Tyr Ser Gly Leu Val Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly 
435 440 445 

Glu Tnr Tyr Thr He Glu Thr Gly Trp He Tyr Asp Met Glu Asn Glu 
450 455 460 

Ser Asp Lys Tyr Tyr Phe Asn Pro Glu Thr Lys Lys Ala Cys Lys Gly 
465 470 475 480 

He Asn Leu He Asp Asp He Lys Tyr Tyr Phe Asp Glu Lys Gly He 
485 490 495 

Met Arg Thr Gly Leu He Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn 
500 505 510 

Glu Asn Gly Glu Met Gin Phe Gly Tyr He Asn He Glu Asp Lys Met 
515 520 525 

Phe Tyr Phe Gly Glu Asp Gly Val Met Gin He Gly Val Phe Asn Thr 
530 535 540 

Pro Asp Gly Phe Lys Tyr Phe Ala His Gin Aon Thr Leu Asp Glu Asn 
545 550 555 560 

Phe Glu Gly Glu Ser He Asn Tyr Thr Glv Trp Leu Asp Leu Asp Glu 

565 570 575 

Lys Arg Tyr Tyr Phe Thr Asp Glu Tyr He Ala Ala Thr Gly Ser Val 
580 585 590 

He He Asp Gly Glu Glu Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu 
595 600 605 

(2) INFORMATION FOR SEQ ID NO: 22: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . .1314 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

ATG GCT CGT CTG CTG TCT ACC TTC ACT GAA TAC ATC AAG AAC ATC ATC 4 8 

Met Ala Arg Leu Leu Ser Thr Phe Thr Glu Tyr He Lys Asn He He 
^5 10 15 

AAT ACC TCC ATC CTG AAC CTG CGC TAC GAA TCC AAT CAC CTG ATC GAC 96 
Asn Thr Ser He Leu Asn Leu Arg Tyr Glu Ser Asn His Leu He Asp 
20 25 30 

CTG TCT CGC TAC GCT TCC AAA ATC AAC ATC GGT TCT AAA GTT AAC TTC 144 
Leu Ser Arg Tyr Ala Ser Lys He Asn He Gly Ser Lys Val Asn Phe 
35 40 45 



1^ GAT CCG ATC GAC AAG AAT CAG ATC CAG CTG TTC AAT CTG GAA TCT TCC 

Asp Pro He Asp Lys Asn Gin He Gin Leu Phe Asn Leu Glu Ser Ser 
50 55 60 



192 



AAA ATC GAA GTT ATC CTG AAG AAT GCT ATC GTA TAC AAC TCT ATG TAC 240 

Lys He Glu Val He Leu Lys Asn Ala He Val Tyr Asn Ser Met Tyr 

65 70 75 80 

GAA AAC TTC TCC ACC TCC TTC TGG ATC CGT ATC CCG AAA TAC TTC AAC 288 

Glu Asn Phe Ser Thr Ser Phe Trp He Arg He Pro Lys Tyr Phe Asn 

85 90 95 

TCC ATC TCT CTG AAC AAT GAA TAC ACC ATC ATC AAC TCC ATG GAA AAC 3 36 

Ser lie Ser Leu Asn Asn Glu Tyr Thr He He Asn Cys Met Glu Asn 

100 105 110 

AAT TCT GGT TGG AAA GTA TCT CTG AAC TAC GGT GAA ATC ATC TGG ACT 384 

Asn Ser Gly Trp Lys Val Ser Leu Asn Tyr Gly Glu He He Trp Thr 

115 120 125 



CTG CAG GAC ACT CAG GAA ATC AAA CAG CGT GTT GTA TTC AAA TAC TCT 4 32 

Leu Gin Asp Thr Gin Glu He Lys Gin Arg Val Val Phe Lys Tyr Ser 
130 135 140 



CAG ATG ATC AAC ATC TCT GAC TAC ATC AAT CGC TGG ATC TTC GTT ACC 4 80 

Gin Met He Asn He Ser Asp Tyr He Asn Arg Trp He Phe Val Thr 
145 150 155 160 



ATC ACC AAC AAT CGT CTG AAT AAC TCC AAA ATC TAC ATC AAC GGC CGT 528 
He Thr Asn Asn Arg Leu Asn Asn Ser Lys He Tyr He Asn Gly Arg 
'♦•'^ 165 170 175 



CTG ATC GAC CAG AAA CCG ATC TCC AAT CTG GGT AAC ATC CAC GCT TCT 576 
Leu He Asp Gin Lys Pro He Ser Asn Leu Gly Asn He His Ala Ser 
180 185 190 

AAT AAC ATC ATG TTC AAA CTG GAC GGT TGT CGT GAC ACT CAC CGC TAC 624 
Asn Asn He Met Phe Lys Leu Asp Gly Cys Arg Asp Thr His Arg Tyr 
195 200 205 



:>:> ATC TGG ATC AAA TAC TTC AAT CTG TTC GAC AAA GAA CTG AAC GAA AAA 672 

He Trp lie Lys Tyr Phe Asn Leu Phe Asp Lys Glu Leu Asn Glu Lys 

210 215 220 



GAA ATC AAA GAC CTG TAC GAC AAC CAG TCC AAT TCT GGT ATC CTG AAA 720 
Glu He Lys Asp Leu Tyr Asp Asn Gin Ser Asn Ser Gly He Leu Lyn 
225 • 230 235 240 



GAC TTC TGG GGT GAC TAC CTG CAG TAC GAC AAA CCG TAC TAC ATG CTG 768 
Asp Phe Trp Gly Asp Tyr Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu 

245 250 255 



AAT CTG TAC GAT CCG AAC AAA TAC GTT GAC GTC AAC AAT GTA GGT ATC 816 
Asn Leu Tyr Asp Pro Asn Lys Tyr Val Asp Val Asn Asn Val Gly He 
260 265 270 
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CGC GGT TAC ATG TAC CTG AAA GGT CCG CGT GGT TCT GTT ATG ACT ACC 864 
Arg Gly Tyr Met Tyr Leu Lys Gly Pro Arg Gly Ser Val Met Thr Thr 
275 280 285 

AAC ATC TAC CTG AAC TCT TCC CTG TAC CGT GGT ACC AAA TTC ATC ATC 912 
Asn He Tyr Leu Asn Ser Ser Leu Tyr Arg Gly Thr Lys Phe He He 
290 295 300 

AAG AAA TAC GCG TCT GGT AAC AAG GAC AAT ATC GTT CGC AAC AAT GAT 960 
Lys Lys Tyr Ala Ser Gly Asn Lys Asp Asn He Val Arg Asn Asn Asp 
305 310 315 320 

CGT GTA TAC ATC AAT GTT GTA GTT AAG AAC AAA GAA TAC CGT CTG GCT 1008 
Arg Val Tyr He Asn Val Val Val Lys Asn Lys Glu Tyr Arg Leu Ala 
325 330 335 

ACC AAT GCT TCT CAG GCT GGT GTA GAA AAG ATC TTG TCT GCT CTG GAA 1056 
Thr Asn Ala Ser Gin Ala Gly Val Glu Lys He Leu Ser Ala Leu Glu 
340 345 350 

ATC CCG GAC GTT GGT AAT CTG TCT CAG GTA GTT GTA ATG AAA TCC AAG 1104 
He Pro Asp Val Gly Asn Leu Ser Gin Val Val Val Met Lys Ser Lys 
355 360 365 



2^ AAC GAC CAG GGT ATC ACT AAC AAA TGC AAA ATG AAT CTG CAG GAC AAC 

Asn Asp Gin Gly He Thr Asn Lva Cys Lys Met Asn Leu Gin Asp Asn 
370 375 380 



70 



Lys He Glu Val He Leu Lys Asn Ala He Val Tyr Asn Ser Met Tvr 
65 70 75 80 



1152 



AAT GGT AAC GAT ATC GGT TTC ATC GGT TTC CAC CAG TTC AAC AAT ATC 1200 
Asn Gly Asn Asp He Gly Phe He Gly Phe His Gin Phe Asn Asn lie 
385 390 395 400 



GCT AAA CTG GTT GCT TCC AAC TGG TAC AAT CGT CAG ATC GAA CGT TCC 124 8 

Ala Lys Leu Val Ala Ser Asn Trp Tyr Asn Arg Gin He Glu Arg ser 
405 410 415 



TCT CGC ACT CTG GGT TGC TCT TGG GAG TTC ATC CCG GTT GAT GAC GGT 12 96 

Ser Arg Thr Leu Gly Cys Ser Trp Glu Phe He Pro Val Asp Asp Glv 
420 425 430 

TGG GGT GAA CGT CCG CTG TAACCCGGGA AAGCTT 133 0 

Trp Giy Glu Arg Pro Leu 
435 

45 (2) INFORMATION FOR SEO ID NO; 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 8 amino acids 

(B) TYPE: amino acid 
^0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Ala Arg Leu Leu Ser Thr Phe Thr Glu Tyr He Lys Asn He He 
15 10 15 

Asn Thr Ser He Leu Asn Leu Arg Tyr Glu Ser Asn His Leu He Asp 
"(J 20 25 30 

Leu Ser Arg Tyr Ala Ser Lys He Asn He Gly Ser Lys Val Asn Phe 
35 40 45 

65 Asp Pro He Asp Lys Asn Gin He Gin Leu Phe Asn Leu Glu Ser Ser 
50 55 60 



- 263 - 



WO98/08S40 



PCT/US97/15394 



Glu Asn Phe Ser Thr Ser Phe Trp He Arg He Pro Lys Tyr Phe Asn 

85 90 95 

Ser He Ser Leu Asn Asn Glu Tyr Thr He He Asn Cys Met Glu Asn 
100 105 110 

Asn Ser Gly Trp Lys Val Ser Leu Asn Tyr Gly Glu He He Trp Thr 
115 120 125 

Leu Gin Asp Thr Gin Glu He Lys Gin Arg Val Val Phe Lys Tyr Ser 

135 140 

Gin Met He Asn He Ser Asp Tyr He Asn Arg Trp He Phe Val Thr 

155 160 

He Thr Asn Asn Arg Leu Asn Asn Ser Lys He Tyr He Asn Gly Arq 

170 175 

Leu He Asp Gin Lys Pro He Ser Asn Leu Gly Asn He His Ala Ser 
ISO 185 190 

Asn Asn He Met Phe Lys Leu Asp Gly Cys Arg Asp Thr His Arg Tyr 
195 200 

He Trp He Lys Tyr Phe Asn Leu Phe Asp Lys Glu Leu Asn Glu Lys 
*.1CJ 215 



0 



Glu He Lys Asp Leu Tyr Asp Asn Gin Ser Asn Ser Gly lie Leu Lys 
""•^ 230 235 240 

Asp Phe Trp Gly Asp Tyr Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu 
245 250 255 

Asn Leu Tyr Asp Pro Asn Lys Tyr Val Asp Val Asn Asn Val Gly He 
260 265 270 

Arg Gly Tyr Met Tyr Leu Lys Gly Pro Arg Gly Ser Val Met Thr Thr 
275 280 285 

Asn He Tyr Leu Asn Ser Ser Leu Tyr Arg Gly Thr Lvs Phe He He 
290 295 300 ' 

Lys Lys Tyr Ala Ser Gly Asn Lys Asp Asn He Val Arg Asn Asn Asp 
•^05 310 315 320 

Arg Val Tyr He Asn Val Val Val Lys Asn Lys Glu Tyr Arc Leu Ala 
325 330 335 

Thr Asn Ala Ser Gin Ala Gly Val Glu Lys He Leu Ser Ala Leu Glu 
340 345 350 

He Pro Asp Val Gly Asn Leu Ser Gin Val Val Val Met Lys Ser Lvs 
355 360 365 

Asn Asp Gin Gly He Thr Asn Lys Cys Lys Met Asn Leu Gin Asp Asn 
3'0 375 380 

Asn Gly Asn Asp He Gly Phe He Gly Phe His Gin Phe Asn Asn He 

390 395 400 

Ala Lys Lfeu Val Ala Ser Asn Trp Tyr Asn Arg Gin He Glu Arg Ser 
405 410 415 

Ser Arg Thr Leu Gly Cys Ser Trp Glu Phe He Pro Val Asp Asn Glv 
420 425 430 

Trp Gly Glu Arg Pro Leu 
435 

(2) INFORMATION FOR SEQ ID NO: 24: 
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10 



15 



30 



3^ 



40 



50 



60 



65 



70 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
fD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
15 10 15 



He Glu Gly Arg His Met Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 25: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1402 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (genomic) 



iix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1386 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

ATG GGC CAT CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT 4 8 

Met Gly His His His His His His His His His His Ser Ser Gly H^s 
is 10 15 

ATC GAA GGT CGT CAT ATG GCT AGC ATG GCT CGT CTG CTG TCT ACC T^C 96 
He Glu Gly Arg His Met Ala Ser Met Ala Arg Leu Leu Ser Thr Phe 
20 25 30 

ACT GAA TAC ATC AAG AAC ATC ATC AAT ACC TCC ATC CTG AAC CTG CGC 144 
Thr Glu Tyr He Lys Asn He He Asn Thr Ser He Leu Asn Leu Arg 
35 40 45 

45 TAC GAA TCC AAT CAC CTG ATC GAC CTG TCT CGC TAC GCT TCC AAA ATC 192 

Tyr Glu Ser Asn Hie Leu He Asp Leu Ser Arg Tyr Ala Ser Lys He 
50 55 60 

AAC ATC GGT TCT AAA GTT AAC TTC GAT CCG ATC GAC AAG AAT CAG ATC 240 
Asn He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin He 
^5 70 75 80 

CAG CTG TTC AAT CTG GAA TCT TCC AAA ATC GAA GTT ATC CTG AAG AAT 288 
Gin Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu Lys Asn 

85 90 95 



GCT ATC GTA TAC AAC TCT ATG TAC GAA AAC TTC TCC ACC TCC TTC TGG 336 
Ala He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser Phe Trn 

100 105 110 

ATC CGT ATC CCG AAA TAC TTC AAC TCC ATC TCT CTG AAC AAT GAA TAC 384 
He Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu Tvr 
115 120 125 

ACC ATC ATC AAC TGC ATG GAA AAC AAT TCT GGT TGG AAA GTA TCT CTG 4 32 

Thr He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu 
130 135 

AAC TAC GGT GAA ATC ATC TGG ACT CTG CAG GAC ACT CAG GAA ATC AAA 480 
Asn Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu He Lys 

- 265 - 
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150 155 160 

CAG CGT GTT GTA TTC AAA TAG TCT CAG ATG ATC AAC ATC TCT GAC TAC 
Gin Arg Val Val Phe Lys Tyr Ser Gin Met He Asn He Ser Asp Tyr 
165 170 175 

ATC AAT CGC TGG ATC TTC GTT ACC ATC ACC AAC AAT CGT CTG AAT AAC 
He Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu Asn Asn 
180 185 190 

TCC AAA ATC TAC ATC AAC GGC CGT CTG ATC GAC CAG AAA CCG ATC TCC 
Ser Lys He Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro lie Ser 
195 200 205 

AAT CTG GGT AAC ATC CAC GCT TCT AAT AAC ATC ATG TTC AAA CTG GAC 
Asn Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys Leu Asd 
-10 215 220 

GGT TGT CGT GAC ACT CAC CGC TAC ATC TGG ATC AAA TAC TTC AAT CTG 
Gly Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe Asn Leu 
-25 230 235 240 

TTC GAC AAA GAA CTG AAC GAA AAA GAA ATC AAA GAC CTG TAC GAC AAC 
Phe Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr Asp Asn 
245 250 255 

CAC TCC AAT TCT GGT ATC CTG AAA GAC TTC TGG GGT GAC TAC CTG CAG 
Gin Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin 
260 265 270 

TAC GAC AAA CCG TAC TAC ATG CTG AAT CTG TAC GAT CCG AAC AAA TAC 
Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn Lys Tyr 
275 280 285 

GTT CAC GTC AAC AAT GTA GGT ATC CGC GGT TAC ATG TAC CTG AAA GGT 
Val Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu Lys Gly 
290 295 300 

CCG CGT GGT TCT GTT ATG ACT ACC AAC ATC TAC CTG AAC TCT TCC CTG 
Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser Ser Leu 
305 310 315 320 

TAC CGT GGT ACC AAA TTC ATC ATC AAG AAA TAC GCG TCT GGT AAC AAG 
Tyr Arg Gly Thr Lys Phe Ho He Lys Lys Tvr Ala Ser Gly Asn Lys 
325 330 335 

GAC AAT ATC GTT CGC AAC AAT GAT CGT GTA TAC ATC AAT GTT GTA GTT 
Asp Asn He Val Arg Asn Asn Asp Arg Val Tyr He Asn Val Val Val 
340 345 350 

AAG AAC AAA GAA TAC CGT CTG GCT ACC AAT GCT TCT CAG GCT GGT GTA 
Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala Gly Val 
355 360 365 

GAA AAG ATC TTG TCT GCT CTG GAA ATC CCG GAC GTT GGT AAT CTG TCT 
Glu Lys He Leu Ser Ala Leu Glu He Pro Asp Val Gly Asn Leu Ser 
370 375 380 

CAG GTA GTT GTA ATG AAA TCC AAG AAC GAC CAG GGT ATC ACT AAC AAA 
Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Glv He Thr Asn Lys 
385 • 390 395 * . 400 

TGC AAA ATG AAT CTG CAG GAC AAC AAT GGT AAC GAT ATC GGT TTC ATC 
Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly Phe He 
405 410 415 
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C3GT TTC CAC CAG TTC AAC AAT ATC GCT AAA CTG GTT GCT TCC AAC TGG 1296 
Gly Phe His Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser Asn Trp 
420 425 430 

TAC AAT CGT CAG ATC GAA CGT TCC TCT CGC ACT CTG GGT TGC TCT TGG 1344 
Tyr Asn Arg Gin He Glu Arg Ser Ser Arg Thr Leu Gly Cys Ser Trp 
435 440 445 

GAG TTC ATC CCG GTT GAT GAC GGT TGG GGT GAA CGT CCG CTG 1386 
Glu Phe He Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
450 455 460 



TAACCCGGGA AAGCTT 

(2) INFORMATION FOR SEQ ID NO: 26; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 26: 

Met Giy His His His His His His His His His His Ser Ser Gly His 
15 10 15 

He Glu Gly Arg His Met Ala Ser Met Ala Arg Leu Leu Ser Thr Phe 
20 25 30 

Thr Glu Tyr He Lys Asn He He Asn Thr Ser He Leu Asn Leu Arq 
35 40 45 

Tyr Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Scr Lys He 
50 55 60 

Asn He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin He 
^5 70 75 80 

Gin Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu Lys Asn 
85 90 95 

Ala He Val Tyr Asn Ser Met Tyr Glu Asn Phe Sor Thr Scr Phe Trp 
100 105 110 

He Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu Tyr 
lis 120 125 

Thr He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu 
130 135 140 

Asn Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu He Lys 
145 150 155 160 

Gin Arg Val Val Phe Lys Tyr Ser Gin Met He Asn He Ser Asp Tvr 

165 170 175 

He Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu Asn Asn 
, 180 185 190 



140: 
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15 



30 



40 



45 



Ser Lys He Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro He Ser 
195 200 205 

Asn Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys Leu Asp 
210 215 220 

Gly Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe Asn Leu 
225 230 235 240 

Phe Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr Asp Asn 
245 250 255 

Gin Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin 

260 265 270 

Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn Lys Tvr 
275 280 285 

Val Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu Lys Gly 

290 295 300 

Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser Ser Leu 

310 315 320 

Tvr Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly Asn Lys 
325 330 335 

Asp Asn lie Val Arg Asn Asn Asp Arg Val Tyr He Asn Val Val Val 
340 345 350 

Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala Gly Val 
355 360 365 

Glu Lys He Leu Ser Ala Leu Glu He Pro Asp Val Glv Asn Leu Ser 
370 375 380 

Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr Asn Lvs 
385 390 395 400 

Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly Phe He 
405 410 415 

Gly Phe His Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser Asn Trp 
420 425 430 

Tyr Asn Arg Gin He Glu Arg Ser Ser Arg Thr Leu Gly Cyj Ser Trp 
435 440 445 



. Glu Phe He Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 

•'^O 450 455 460 

i2) INFORMATION FOR SEO ID NO: 27; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

W> (ii) MOLECULE TYPE: DNA (genomic) 
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96 



144 



240 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .3888 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27; 

ATG CAA TTT GTT AAT AAA CAA TTT AAT TAT AAA GAT CCT GTA AAT GGT 43 
Met Gin Phe Val Asn Lys Gin Phe Asn Tvr Lys Asp Pro Val Asn Gly 
^5 10 15 

GTT GAT ATT GCT TAT ATA AAA ATT CCA AAT GTA GGA CAA ATG CAA CCA 
Val Asp He Ala Tyr He Lys He Pro Asn Val Gly Gin Mec Gin Pro 
20 25 30 

GTA AAA GCT TTT AAA ATT CAT AAT AAA ATA TGG GTT ATT CCA GAA AGA 
Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro Glu Arq 
35 40 45 ^ 

GAT ACA TTT ACA AAT CCT GAA GAA GGA GAT TTA AAT CCA CCA CCA GAA i9-> 
Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro Glu 

55 60 

GCA AAA CAA GTT CCA GTT TCA TAT TAT GAT TCA ACA TAT TTA AGT ACA 
Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr 

70 75 80 

GAT AAT GAA AAA GAT AAT TAT TTA AAG GGA GTT ACA AAA TTA TTT GAG 288 
Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu 
85 90 95 

AGA ATT TAT TCA ACT GAT CTT GGA AGA ATG TTG TTA ACA TCA ATA GTA 336 
Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser He Val 
100 105 110 

AGG GGA ATA CCA TTT TGG GGT GGA AGT ACA ATA GAT ACA GAA TTA AAA 384 
Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu Leu Lys 
115 120 125 

GTT ATT GAT ACT AAT TGT ATT AAT GTG ATA CAA CCA GAT GGT AGT TAT 432 
Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly Ser Tyr 
1^0 135 14Q 

AGA TCA GAA GAA CTT AAT CTA GTA ATA ATA GGA CCC TCA GCT GAT ATT 4 80 

Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala Asd He 
^"^^ 150 155 160 

ATA CAG TTT GAA TGT AAA AGC TTT GGA CAT GAA GTT TTG AAT CTT ACG 528 
He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn Leu Thr 
165 170 175 

CGA AAT GGT TAT GGC TCT ACT CAA TAC ATT AGA TTT AGC CCA GAT TTT 576 
Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro Asp Phe 
180 185 ;l90 

ACA TTT GGT TTT GAG GAG TCA CTT GAA GTT GAT ACA AAT CCT CTT TTA 624 
Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu 
195 200 205 

GGT GCA GGC AAA TTT GCT ACA GAT CCA GCA GTA ACA TTA GCA CAT GAA 67-> 
Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu 
210 215 220 

CTT ATA CAT GCT GGA CAT AGA TTA TAT GGA ATA GCA ATT AAT CCA AAT 7-^0 
Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn Pro Asn 

230 235 240 

AGG GTT TTT AAA GTA AAT ACT AAT GCC TAT TAT GAA ATG AGT GGG TTA 768 
Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu 
245 250 255 
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GAA GTA AGC TTT GAG GAA CTT AGA ACA TTT GGG GGA CAT GAT GCA AAG 816 
Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys 
260 265 270 

^ TTT ATA GAT AGT TTA CAG GAA AAC GAA TTT CGT CTA TAT TAT TAT AAT 864 

Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn 
275 280 285 

AAG TTT AAA GAT ATA GCA AGT ACA CTT AAT AAA GCT AAA TCA ATA GTA 912 
10 Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser He Val 
290 295 300 

GGT ACT ACT GCT TCA TTA CAG TAT ATG AAA AAT GTT TTT AAA GAG AAA 96 0 

Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Glu Lys 
1^ .^05 310 315 320 

TAT CTC CTA TCT GAA GAT ACA TCT GGA AAA TTT TCG GTA GAT AAA TTA 1008 
Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu 
325 330 335 

AAA TTT GAT AAG TTA TAC AAA ATG TTA ACA GAG ATT TAC ACA GAG GAT 1056 
Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr Glu Asp 
340 345 350 

25 AAT TTT GTT AAG TTT TTT AAA GTA CTT AAC AGA AAA ACA TAT TTG AAT 1104 

Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn 
355 360 365 

TTT GAT AAA GCC GTA TTT AAG ATA AAT ATA GTA CCT AAG GTA AAT TAC 1152 
30 Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val Asn Tvr 
370 375 380 

ACA ATA TAT GAT GGA TTT AAT TTA AGA AAT ACA AAT TTA GCA GCA AAC 1200 
^_ Thr He Tyr Asp Giy Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn 

J.*^ 385 390 395 400 



40 



60 



TTT AAT GGT CAA AAT ACA GAA ATT AAT AAT ATG AAT TTT ACT AAA CTA 1248 
Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr Lvs Leu 
405 410 415 

AAA AAT TTT ACT GGA TTG TTT GAA TTT TAT AAG TTG CTA TGT GTA AGA 1296 
Lys Asn Phe Thr Gly Leu Phe Glu Phe Tvr Lys Leu Leu Cys Val Arg 
420 425 * 430 



4? GGG ATA ATA ACT TCT AAA ACT AAA TCA TTA GAT AAA GGA TAC AAT AAG 1344 

Gly He He Thr Ser Lys Thr Lys Ser Leu Asp Lys Gly Tyr Asn Lvs 
435 440 445 

GCA TTA AAT GAT TTA TGT ATC AAA GTT AAT AAT TGG GAC TTG TTT TTT 13 92 

50 Ala Leu Asn Asp Leu Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe 

450 455 460 

AGT CCT TCA GAA GAT AAT TTT ACT AAT GAT CTA AAT AAA GGA GAA GAA 14 4 0 

Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Aon Lys Gly Glu Glu 
55 465 470 475 480 

ATT ACA TCT GAT ACT AAT ATA GAA GCA GCA GAA GAA AAT ATT AGT TTA 14 88 

He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu 
485 490 495 



GAT TTA ATA CAA CAA TAT TAT TTA ACC TTT AAT TTT GAT AAT GAA CCT 1536 
Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro 
500 505 510 



6^ GAA AAT ATT TCA ATA GAA AAT CTT TCA AGT GAC ATT ATA GGC CAA TTA 1584 

Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu 
515 520 525 

GAA CTT ATG CCT AAT ATA GAA AGA TTT CCT AAT GGA AAA AAG TAT GAG 1632 
70 Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu 
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T , , ? * ^'^^ '^'-^ ACT AGC GAA GTA AGT ACT ACG 

Gin Leu val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr 

610 SIS 



530 535 

TTA GAT AAA TAT ACT ATG TTC CAT TAT CTT CGT GCT CAA GAA TTT GAA 
Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu 

550 555 560 

CAT GGT AAA TCT AGG ATT GCT TTA ACA AAT TCT GTT AAC GAA GCA TTA 
His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu 
565 570 575 

TTA AAT CCT AGT CGT GTT TAT ACA TTT TTT TCT TCA GAC TAT GTA AAG 
Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys 
5S0 585 590 

AAA GTT AAT AAA GCT ACG GAG GCA GCT ATG TTT TTA GGC TGG GTA GAA 
Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu 

600 605 

Tl": l'''^ ^''l ir °AA ACT AGC GAA GTA AGT ACT ACG 

Glu 
620 

GAT AAA ATT GCG GAT ATA ACT ATA ATT ATT CCA TAT ATA GGA CCT GCT 
Asp Lys He Ala Asp He Thr He He He Pro Tyr He Gly Pro Ala 

635 640 

TTA AAT ATA GGT AAT ATG TTA TAT AAA GAT GAT TTT GTA GGT GCT TTA 
Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Glv Ala Leu 

650 ' 655 

^TA TTT TCA GGA GCT GTT ATT CTG TTA GAA TTT ATA CCA GAG ATT GCA 
He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu He Ala 

665 670 

ATA CCT GTA TTA GGT ACT TTT GCA CTT GTA TCA TAT ATT GCG AAT AAG 
He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys 

680 685 

GTT CTA ACC GTT CAA ACA ATA GAT AAT GCT TTA AGT AAA AGA AAT GAA 
val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser ^ Arg ^n G^ 
690 695 700 

AAA TGG GAT GAG GTC TAT AAA TAT ATA GTA ACA AAT TGG TTA GCA AAG 
Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lvs 

710 715 720 

^T'^ ^^'^ AAA AAA ATG AAA GAA GCT TTA 

val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu 
725 730 735 

lu. ^^^^ ^^"^ ATA AAC TAT CAG TAT AAT 

Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn 
^"^O 745 750 

CAA TAT ACT GAG GAA GAG AAA AAT AAT ATT AAT TTT AAT ATT GAT GAT 
Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He AsJ AsJ 
^55 760 765 

TTA AGT TCG AAA CTT AAT GAG TCT ATA AAT AAA GCT ATG ATT AAT ATA 
Leu ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met He He 

775 780 

AAT AAA TTT TTG AAT CAA TGC TCT GTT TCA TAT TTA ATG AAT TCT ATG 
Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met 

790 795 

ATC CCT TAT GGT GTT AAA CGG TTA GAA GAT TTT GAT GCT AGT CTT AAA 
He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu ^ 
^^5 810 815 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



21i: 



2160 



2208 



2256 



2304 



2352 



2400 



2446 
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1090 1095 1100 

TTA CAA TAT GAT AAA CCA TAC TAT ATG TTA AAT TTA TAT GAT CCA AAT 3360 
Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn 
1105 1110 1115 1120 

AAA TAT GTC GAT GTA AAT AAT GTA GGT ATT AGA GOT TAT ATG TAT CTT 3408 
Lys Tyr Val Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu 
1125 1130 1135 

AAA GGG CCT AGA GGT AGC GTA ATG ACT ACA AAC ATT TAT TTA AAT TCA 34 56 

Lys Gly Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser 
1140 1145 1150 

ACT TTG TAT AGG GGG ACA AAA TTT ATT ATA AAA AAA TAT GCT TCT GGA 3 504 

Ser Leu Tyr Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly 
1155 1160 il65 

AAT AAA GAT AAT ATT GTT AGA AAT AAT GAT CGT GTA TAT ATT AAT GTA 3552 
Asn Lys Asp Asn He Val Arg Asn Asn Asp Arg Val Tyr He Asn Val 
1170 1175 1180 

GTA GTT AAA AAT AAA GAA TAT AGG TTA GCT ACT AAT GCA TCA CAG GCA 3600 
Val Val Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala 
1185 1190 1195 1200 

GGC GTA GAA AAA ATA CTA AGT GCA TTA GAA ATA CCT GAT GTA GGA AAT 364 8 

Gly Val Glu Lys He Leu Ser Ala Leu Glu He Pro Asp Val Gly Asn 
1205 1210 1215 

CTA AGT CAA GTA GTA GTA ATG AAG TCA AAA AAT GAT CAA GGA ATA ACA 36 96 

Leu Ser Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr 
1220 1225 1230 

AAT AAA TGC AAA ATG AAT TTA CAA GAT AAT AAT GGG AAT GAT ATA GGC 3744 
Asn Lys Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly 
1235 1240 1245 

TTT ATA GGA TTT CAT CAG TTT AAT AAT ATA GCT AAA CTA GTA GCA AGT 3792 
Phe He Gly Phe His Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser 
1250 1255 1260 

AAT TGG TAT AAT AGA CAA ATA GAA AGA TCT AGT AGG ACT TTG GGT TGC 384 0 

Asn Trp Tyr Asn Arg Gin He Glu Arg Ser Ser Arg Thr Leu Gly Cys 
1265 1270 1275 1280 

TCA TGG GAA TTT ATT CCT GTA GAT GAT GGA TGG GGA GAA AGG CCA CTG 3 888 

Ser Trp Glu Phe He Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
1285 1290 1295 



TAA 

(2) INFORMATION FOR SEQ ID NO;28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1296 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



Met Gin Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val Asn Glv 
1 5 10 15 

Val Asp He Ala Tyr He Lys He Pro Asn Val Gly Gin Met Gin Pro 
20 25 30 

Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro Glu Arg 



3891 
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35 



40 



45 



Asp Thr Phe Thr Asn Pro Glu Giu Gly Asp Leu Asn Pro Pro Pro Glu 

55 60 

Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr 

70 75 8Q 

Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu 
85 90 95 

Arg lie Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser Ide Val 

105 110 

Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu Leu Lys 
H5 120 125 

Val lie Asp Thr Asn Cys lie Asn Val He Gin Pro Asp Gly Ser Tyr 

Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala Asp He 

150 155 160 

He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn Leu Thr 

170 175 

Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro Asp Phe 
180 185 190 

Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu 
155 200 205 

Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu 
-10 215 220 

Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn Pro Asn 
^•^5 230 235 240 

Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu 
245 250 255 

Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys 
260 265 270 

Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn 
-75 280 285 

Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser Il*» Val 
290 295 300 

Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Glu Lys 

310 315 320 

Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu 

325 330 335 

Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr Glu Asp 
340 345 350 

Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn 
355 360 365 

Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val Asn Tvr 
370 375 380 

Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn 

390 395 400 

Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr Lys Leu 
405 410 415 
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Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lvs 
420 425 

Gly He He Thr Ser Lys Thr Lys Ser Leu Asp 
435 

Ala Leu Asn Asp Leu Cys He Lys Val Asn Asn 
450 455 



Ser Pro Ser Glu Asp Asn Phe Thr Asn 
465 470 



Asp Leu 
475 



Phe Asn 
Ser Asp 
Pro Asn 



He Thr Ser Asp Thr Asn He Glu Ala Ala Glu 
485 490 

Asp Leu He Gin Gin Tyr Tyr Leu Thr 
500 505 

Glu Asn He Ser He Glu Asn Leu Ser 
515 520 

Glu Leu Met Pro Asn He Glu Arq Phe 
530 535 

Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg 
545 550 555 

His Gly Lys Ser Arg He Ala Leu Thr Asn Ser 
565 570 

Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser 
580 585 

Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe 

595 600 

Gin Leu val Tyr Asp Phe Thr Asp Glu Thr Ser 
610 615 

Aop Lys He Ala Asp He Thr He He He Pro 

625 630 635 



Leu Leu Cys Val Arg 

430 

Lys Gly Tyr Asn Lys 
445 

Trp Asp Leu Phe Phe 

460 

Asn Lys Gly Glu Glu 
480 

Glu Asn He Ser Leu 
495 

Phe Asp Asn Glu Pro 
510 

He He Gly Gin Leu 
525 

Gly Lys Lys Tyr Glu 
540 

Ala Gin Glu Phe Glu 
560 

Val Asn Glu Ala Leu 
575 

Ser Asp Tyr Val Lys 
590 

Leu Gly Trp Val Glu 
605 

Glu Val Ser Thr Thr 
620 

Tyr He Gly Pro Ala 
640 



Leu Asn He Gly Asn Met Leu Tyr Lys 
645 

He Phe Ser Gly Ala Val He Leu Leu 
660 665 

He Pro Val Leu Gly Thr Phe Ala Leu 
675 680 

Val Leu Thr Val Gin Thr He Asp Asn 
690 695 

Lys Trp Asp Glu Val Tyr Lys Tyr He 
705 710 

Val Asn Thr Gin He Asp Leu He Arg 
725 

Glu Asn Gin Ala Glu Ala Thr Lys Ala 
740 745 

Gin Tyr Thr Glu Glu Glu Lys Asn Asn 
755 760 

Leu Ser Ser Lys Leu Asn Glu Ser He 
770 775 

Asn Lys Phe Leu Asn Gin Cys Ser Val 



Asp Asp Phe Val Gly Ala Leu 
650 655 

Glu Phe He Pro Glu He Ala 
670 

Val Ser Tyr He Ala Asn Lys 
685 

Ala Leu Ser Lys Arg Asn Glu 
700 

Val Thr Asn Trp Leu Ala Lys 
715 720 

Lys Lys Met Lys Glu Ala Leu 
730 735 



He He 
He Asn 
Asn Lys 
Ser Tyr Leu Met Asn Ser Met 



Asn Tyr Gin Tyr Asn 
750 

Phe Asn He Asp Asp 
765 

Ala Met He Asn He 
780 
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785 



790 



795 



800 



He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys 

BIO 815 

Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly 
820 825 830 

Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp 
835 840 845 

He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser 
8S0 855 860 

Thr Phe Thr Glu Tyr He Lys Asn He He Asn Thr Ser He Leu Asn 
865 870 875 880 

Leu Arg Tyr Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser 
885 890 895 

Lys He Asn He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn 
900 905 910 

Gin He Gin Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu 
915 920 925 

Lys Asn Ala He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser 
930 935 940 

Phe Trp He Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn 
945 950 955 960 

Glu Tyr Thr He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val 
965 970 975 

Ser Leu Asn Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu 
980 985 990 

He Lys Gin Arg Val Val Phe Lys Tyr Ser Gin Met He Asn He Ser 
995 1000 1005 

Asp Tyr He Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu 
1010 1015 1020 

Asn Asn Ser Lys He Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro 
-025 1030 1035 1040 

He Ser Asn Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys 
1045 1050 1055 

Leu Asp Gly Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe 
1060 1065 1070 

Asn Leu Phe Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr 
1075 1080 1085 

Asp Asn Gin Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr 
1090 1095 1100 

Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn 
H05 1110 1115 1120 

Lys Tyr Val Asp Val Asn Asn Val Gly He Arg Gly Tvr Met Tyr Leu 
H25 1130 ' 1135 

Lys Gly Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser 
1140 1145 1150 

Ser Leu Tyr Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly 
1155 1160 1165 
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Asn Lys Asp Asn He Val Arg Asn Asn Asp Arg Val Tyr He Asn Val 
1170 1X75 1180 

Val Val Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala 
^^^^ 1190 1195 X200 

Gly Val Glu Lys He Leu Ser Ala Leu Glu He Pro Asp Val Gly Asn 
1205 1210 1215 

Leu Ser Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr 
1220 1225 1230 

Asn Lys Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp lie Glv 
1235 1240 1245 

Phe He Gly Phe His Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser 
1250 1255 1260 

Asn Trp Tyr Asn Arg Gin He Glu Arg Ser Ser Arg Thr Leu Gly Cys 
^265 1270 1275 1280 

Ser Trp Glu Phe He Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
1285 1290 1295 

(2) INFORMATION FOR SEQ ID NO: 29: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

CGCCATGGCT AGATTATTAT CTACATTTAC 

(2) INFORMATION FOR SEQ ID NO: 30: 

Ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GCAAGCTTCT TGACAGACTC ATGTAG 

(2) INFORMATION FOR SEQ ID NO: 31: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
* (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 
TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACCATG GGCCATCATC 1 
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15 



ATCATCATCA TCATCATCAT CACAGCAGCG GCCATATCGA AGGTCGTCAT ATGGCTAGCA 18( 

TGGCTAGATT ATTATCTACA TTTACTGAAT ATATTAAGAA TATTATTAAT ACTTCTATAT 24 ( 

TGAATTTAAG ATATGAAAGT AATCATTTAA TAGACTTATC TAGGTATGCA TCAAAAATAA 30( 

ATATTGGTAG TAAAGTAAAT TTTGATCCAA TAGATAAAAA TCAAATTCAA TTATTTAATT 36 i 

TAGAAAGTAG TAAAATTGAG GTAATTTTAA AAAATGCTAT TGTATATAAT AGTATGTATG 42( 

AAAATTTTAG TACTAGCTTT TGGATAAGAA TTCCTAAGTA TTTTAACAGT ATAAGTCTAA 4e( 

ATAATGAATA TACAATAATA AATTGTATGG AAAATAATTC AGGATGGAAA GTATCACTTA 54c 

ATTATGGTGA AATAATCTGG ACTTTACAGG ATACTCAGGA AATAAAACAA AGAGTAGTTT 60C 

TTAAATACAG TCAAATGATT AATATATCAG ATTATATAAA CAGATGGATT TTTGTAACTA 66C 

TCACTAATAA TAGATTAAAT AACTCTAAAA TTTATATAAA TGGAAGATTA ATAGATCAAA 72 C 

AACCAATTTC AAATTTAGGT AATATTCATG CTAGTAATAA TATAATGTTT AAATTAGATG 78C 

GTTGTAGAGA TACACATAGA TATATTTGGA TAAAATATTT TAATCTTTTT GATAAGGAAT 84 C 

25 TAAATGAAAA AGAAATCAAA GATTTATATG ATAATCAATC AAATTCAGGT ATTTTAAAAG 900 

ACTTTTGGGG TGATTATTTA CAATATGATA AACCATACTA TATGTTAAAT TTATATGATC 960 

^AAATAAATA TGTCGATGTA AATAATGTAG GTATTAGAGG TTATATGTAT CTTAAAGGGC 1020 

CTAGAGGTAG CGTAATGACT ACAAACATTT ATTTAAATTC AAGTTTGTAT AGGGGGACAA 1080 

AATTTATTAT AAAAAAATAT GCTTCTGGAA ATAAAGATAA TATTGTTAGA AATAATGATC 1140 

35 GTGTATATAT TAATGTAGTA GTTAAAAATA AAGAATATAG GTTAGCTACT AATGCATCAC 1200 

AGGCAGGCGT AGAAAAAATA CTAAGTGCAT TAGAAATACC TGATGTAGGA AATCTAAGTC 1260 

AAGTAGTAGT AATGAAGTCA AAAAATGATC AAGGAATAAC AAATAAATGC AAAATGAATT 1320 
TACAAGATAA TAATGGGAAT GATATAGGCT TTATAGGATT TCATCAGTTT AATAATATAG 
CTAAACTAGT AGCAAGTAAT TGGTATAATA GACAAATAGA AAGATCTAGT AGGACTTTGG 
45 GTTGCTCATG GGAATTTATT CCTGTAGATG ATGGATGGGG AGAAAGGCCA CTGTAATTAA 
TCTCAAACTA CATGAGTCTG TCAAGAAGCT TGCGGCCGCA CTCGAG 
(2) INFORMATION FOR SEQ ID NO: 32: 



50 



60 



65 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

Ui) SEQUENCE DESCRIPTION; SEQ ID NO: 32: 

Met His His His His His His Met Ala 
1 5 

(2) INFORMATION FOR SEO ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: single 

'0 (D) TOPOLOGY: linear 



1380 
1440 
1500 
1546 
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(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

TATGCATCAC CATCACCATC A 

<2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "DNA" 

«xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CATGTGATGG TGATGGTGAT GCA 

(2) INFORMATION FOR SEQ ID NO: 35: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1335 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ATG CAT CAC CAT CAC CAT CAC ATG GCT CGT CTG CTG TCT ACC TTC ACT 
Met His His His His His His Met Ala Arg Leu Leu Ser Thr Phe Thr 
15 10 15 

GAA TAC ATC AAG AAC ATC ATC AAT ACC TCC ATC CTG AAC CTG CGC TAC 
Glu Tyr He Lys Asn He He Asn Thr Ser He Leu Asn Leu Arg Tyr 
20 25 30 

GAA TCC AAT CAC CTG ATC GAC CTG TCT CGC TAC GCT TCC AAA ATC AAC 1 
Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser Lys He Asn 
35 40 45 

ATC GGT TCT AAA GTT AAC TTC GAT CCG ATC GAC AAG AAT CAG ATC CAG 1 
He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin He Gin 
50 55 60 

CTG TTC AAT CTG GAA TCT TCC AAA ATC GAA GTT ATC CTG AAG AAT GCT 2 
Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu Lys Asn Ala 
€5 70 75 80 

ATC GTA TAC AAC TCT ATG TAC GAA AAC TTC TCC ACC TCC TTC TGG ATC 2 
He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser Phe Trp He 
85 90 95 

CGT ATC CCG AAA TAC TTC AAC TCC ATC TCT CTG AAC AAT GAA TAC ACC 3 
Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu Tyr Thr 
100 105 110 

ATC ATC AAC TGC ATG GAA AAC AAT TCT GGT TGG AAA GTA TCT CTG AAC 3 
He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu Asn 
115 120 125 
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?w n?^ ^T^ ^'^^ '^^^ ^^'^ ^'^^ GAC ACT CAG GAA ATC AAA CAG 

Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu He Lys Gin 

135 140 

Arl S'^T IV^ ™^ ''^'^ ^'^^ A'T^ AAC ATC TCT GAC TAC ATC 

Aig Val val Phe Lys Tyr Ser Gin Met He Asn lie Ser Asp Tyr lie 

^50 

AAT CGC TGG ATC TTC GTT ACC ATC ACC AAC AAT CGT CTG AAT AAC TCC 
Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu Asn Asn Ser 

170 175 

AAA ATC TAC ATC AAC GGC CGT CTG ATC GAC CAG AAA CCG ATC TCC AAT 
Lys He Tyr He Asn Gly Arg Leu lie Asp Gin Lvs Pro He Ser Asn 

185 190 

CTG GGT AAC ATC CAC GCT TCT AAT AAC ATC ATG TTC AAA CTG GAC GGT 
Leu Gly Asn He His Ala ser Asn Asn He Met Phe Lys Leu Asp Gly 
195 200 205 

TGT CGT GAC ACT CAC CGC TAC ATC TGG ATC AAA TAC TTC AAT CTG TTC 
Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe Asn Leu Phe 

215 220 

GAC AAA GAA CTG AAC GAA AAA GAA ATC AAA GAC CTG TAC GAC AAC CAG 
Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr Asp Asn Gin 

230 235 240 

TCC AAT TCT GGT ATC CTG AAA GAC TTC TGG GGT GAC TAC CTG CAG TAC 
Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin Tyr 
245 250 255 

GAC AAA CCG TAC TAC ATG CTG AAT CTG TAC GAT CCG AAC AAA TAC GTT 
Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn Lys Tyr Val 
260 265 270 

GAC GTC AAC AAT GTA GGT ATC CGC GGT TAC ATG TAC CTG AAA GGT CCG 
Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tvr Leu Lys Gly Pro 
^"^5 280 * 285 

CGT GGT TCT GTT ATG ACT ACC AAC ATC TAC CTG AAC TCT TCC CTG TAC 
Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser Ser Leu Tyr 
-90 295 300 

CGT GGT ACC AAA TTC ATC ATC AAG AAA TAC GCG TCT GGT AAC AAG GAC 
Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly Asn Lys Asp 

3iO 315 320 

AAT ATC GTT CGC AAC AAT GAT CGT GTA TAC ATC AAT GTT GTA GTT AAG 
Asn lie Val Arg Asn Asn Asp Arg Val Tyr He Asn Val Val Val Lys 
325 330 335 

AAC AAA GAA TAC CGT CTG GCT ACC AAT GCT TCT CAG GCT GGT GTA GAA 
Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala Gly Val Glu 
340 345 35Q 

AAG ATC TTG TCT GCT CTG GAA ATC CCG GAC GTT GGT AAT CTG TCT CAG 
Lys He Leu Ser Ala Leu Glu He Pro Asp Val Gly Asn Leu Ser Gin 
355 360 365 

GTA GTT GTA ATG AAA TCC AAG AAC GAC CAG GGT ATC ACT AAC AAA TGC 
Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr Asn Lys Cvs 

370 375 380 

AAA ATG AAT CTG CAG GAC AAC AAT GGT AAC GAT ATC GGT TTC ATC GGT 
Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly Phe He Gly 

390 395 400 

TTC CAC CAG TTC AAC AAT ATC GCT AAA CTG GTT GCT TCC AAC TGG TAC 
Phe His Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser Asn Trp Tyr 

- 280 . 



432 



480 



528 



576 



624 



672 



720 



766 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 
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405 410 415 

AAT CGT CAG ATC GAA CGT TCC TCT CGC ACT CTG GGT TGC TCT TGG GAG 12 96 

Asn Arg Gin lie Glu Arg Ser Ser Arg Thr Leu Gly Cys Ser Trp Glu 
420 425 430 

TTC ATC CCG GTT GAT GAC GGT TGG GGT GAA CGT CCG CTG TAACCCGGGA 134 5 

Phe lie Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
435 440 445 

AAGCTT 

(2) INFORMATION FOR SEQ ID NO: 36: 

ii) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 445 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Met His His His His His His Met Ala Arg Leu Leu Ser Thr Phe Thr 
^5 10 15 

Glu Tyr lie Lys Asn lie He Asn Thr Ser He Leu Asn Leu Arg Tyr 
20 25 30 

Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser Lys He Asn 

35 40 45 

He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin He Gin 
50 55 60 

Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu Lys Asn Ala 

65 70 75 80 

He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser Phe Trp He 
85 90 95 

Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu Tyr Thr 

100 105 110 

He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu Asn 
115 120 125 

Tyr Giy Glu He He Trp Thr Leu Gin Asp Thr Gin Glu He Lys Gin 

130 135 140 

Arg Val Val Phe Lys Tyr Ser Gin Met He Asn He Ser Asp Tyr He 
145 150 155 160 

Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu Aon Asn Ser 
165 170 175 

Lys He Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro He Ser Asn 
180 185 190 

Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys Leu Asp Gly 
195 200 205 

Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe Asn Leu Phe 
210 215 220 

Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr Asp Asn Gin 
225 230 235 240 

Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin Tyr 
245 250 255 
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Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn Lys Tyr Val 
260 265 270 

Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu Lys Gly Pro 
275 280 285 

Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser Ser Leu Tyr 
290 295 300 

Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly Asn Lys Asp 

310 315 320 

Asn IlG Val Arg Asn Asn Asp Arg Val Tyr He Asn Val Val Val Lys 
325 330 335 

Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala Gly Val Glu 
340 345 350 

Lys He Leu Ser Ala Leu Glu He Pro Asp Val Gly Asn Leu Ser Gin 
355 360 365 

Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr Asn Lys Cvs 
370 375 380 

2.^ Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly Phe He Glv 

390 395 400 

Phe liis Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser Asn Trp Tyr 
30 410 415 

Asn Arg Gin He Glu Arg Ser Ser Arg Thr Leu Gly Cys Scr Trp Glu 
420 425 430 

Phe He Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
-'^ 435 440 445 



40 



70 



{2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

50 CGCATATGAA TATTCGTCCA TTGCATG 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
* (A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

65 GGAAGCTTGC AGGGCAATTA CATCATG 

(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3876 base pairs 



27 



27 
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(B) TYPE: nucleic acid 

(C) STRAJTOEDNESS : double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3873 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

ATG CCA GTT ACA ATA AAT AAT TTT AAT TAT AAT GAT CCT ATT GAT AAT 
Met Pro Val Thr IXe Asn Asn Phe Asn Tyr Asn Asp Pro He Asp Asn 
^5 10 15 

GAC AAT ATT ATT ATG ATG GAA CCT CCA TTT GCA AGG GGT ACG GGG AGA 
Asp Asn He He Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arg 
20 25 30 

TAT TAT AAA GCT TTT AAA ATC ACA GAT CGT ATT TGG ATA ATA CCC GAA 
Tyr Tyr Lys Ala Phe Lys He Thr Asp Arg He Trp He He Pro Glu 
35 40 45 

AGA TAT ACT TTT GGA TAT AAA CCT GAG GAT TTT AAT AAA AGT TCC GGT 
Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly 
50 55 60 

ATT TTT AAT AGA GAT GTT TGT GAA TAT TAT GAT CCA GAT TAC TTA AAT 
He Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 
65 70 75 80 

ACC AAT GAT AAA AAG AAT ATA TTT TTC CAA ACA TTG ATC AAG TTA TTT 
Thr Asn Asp Lyo Lys Asn He Phe Phe Gin Thr Leu He Lys Leu Phe 
85 90 95 

AAT AGA ATC AAA TCA AAA CCA TTG GGT GAA AAG TTA TTA GAG ATG ATT 
Asn Arg He Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met He 
100 105 110 

ATA AAT GGT ATA CCT TAT CTT GGA GAT AGA CGT GTT CCA CTC GAA GAG 
He Asn Gly He Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 
H5 120 125 

TTT AAC ACA AAC ATT GCT AGT GTA ACT GTT AAT AAA TTA ATT AGT AAT 
Phe Asn Thr Asn He Ala Ser Val Thr Val Asn Lys Leu He Ser Asn 
130 135 140 

CCA GGA GAA GTG GAG CGA AAA AAA GGT ATT TTC GCA AAT TTA ATA ATA 
Pro Gly Glu Val Glu Arg Lys Lys Gly He Phe Ala Asn Leu He He 
145 150 155 160 

TTT GGA CCT GGG CCA GTT TTA AAT GAA AAT GAG ACT ATA GAT ATA GGT 
Phe Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr He Asp He Gly 
165 170 175 

ATA CAA AAT CAT TTT GCA TCA AGG GAA GGC TTT GGG GGT ATA ATG CAA 
He Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly He Met Gin 
180 185 190 

ATG AAA TTT TGT CCA GAA TAT GTA AGC GTA TTT AAT AAT GTT CAA GAA 
Met Lys Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gin Glu 
195 200 205 

AAC AAA GGC GCA AGT ATA TTT AAT AGA CGT GGA TAT TTT TCA GAT CCA 
Asn Lys Gly Ala Ser He Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 
210 215 220 

GCC TTG ATA TTA ATG CAT GAA CTT ATA CAT GTT TTG CAT GGA TTA TAT 
Ala Leu He Leu Met His Glu Leu He His Val Leu His Gly Leu Tyr 
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225 230 



235 240 



10 



15 



30 



40 



45 



50 



300 



70 



490 495 

TTA ATA AGT AAA ATA GAA TTA CCA AGT GAA AAT ACA GAA TCA CTT ACT 

Leu lie Ser Lys He Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 
500 505 510 



816 



GGC ATT AAA GTA GAT GAT TTA CCA ATT GTA CCA AAT GAA AAA AAA TTT 768 
Gly He Lys Val Asp Asp Leu Pro He Val Pro Asn Glu Lys Lvs Phe 
245 250 255 

TTT ATG CAA TCT ACA GAT ACT ATA GAG GCA GAA GAA CTA TAT ACA TTT 
Phe Met Gin Ser Thr Asp Thr He Gin Ala Glu Glu Leu Tyr Thr Phe 
260 265 270 

GGA GGA CAA GAT CCC AGC ATC ATA TCT CCT TCT ACA GAT AAA AGT ATC 664 
Gly Gly Gin Asp Pro Ser He He Ser Pro Ser Thr Asp Lys aer He 
275 280 285 

TAT GAT AAA GTT TTG CAA AAT TTT AGG GGG ATA GTT GAT AGA CTT AAC 
Tyr Asp Lys Val Leu Gin Asn Phe Arg Gly He Val Asp Arg Leu Asn 
290 295 



912 



1008 



1056 



1104 



AAG GTT TTA GTT TGC ATA TCA GAT CCT AAC ATT AAC ATT AAT ATA TAT 960 
-0 Lys val Leu Val Cys He Ser Asp Pro Asn He Asn He Asn lie Tvr 

310 315 320 

AAA AAT AAA TTT AAA GAT AAA TAT AAA TTC GTT GAA GAT TCT GAA GGA 
Lys Asn Lys Phe Lyss Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Gly 
325 330 335 

AAA TAT AGT ATA GAT GTA GAA AGT TTC AAT AAA TTA TAT AAA AGC TTA 
Lys Tyr Ser He Asp Val Glu Ser Phe Aon Lvs Leu Tyr Lys Ser Leu 
340 345 ' 350 

ATG TTA GGT TTT ACA GAA ATT AAT ATA GCA GAA AAT TAT AAA ATA AAA 
Met Leu Gly Phe Thr Glu He Asn He Ala Glu Asn Tyr Lys He Lys 
355 360 365 

ACT AGA GCT TCT TAT TTT AGT GAT TCC TTA CCA CCA GTA AAA ATA AAA 1152 
Thr Arg Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys He Lvs 
370 375 380 

AAT TTA TTA GAT AAT GAA ATC TAT ACT ATA GAG GAA GGG TTT AAT ATA 1200 
Asn Leu Leu Asp Asn Glu He Tyr Thr He Glu Glu Gly Phe Asn He 
385 390 395 400 

TCT GAT AAA AAT ATG GGA AAA GAA TAT AGG GGT CAG AAT AAA GCT ATA 124 8 

Ser Asp Lys Asn Met Gly Lys Glu Tyr Arg Gly Gin Asn Lys Ala He 
405 410 415 

AAT AAA CAA GCT TAT GAA GAA ATC AGC AAG GAG CAT TTG GCT GTA TAT 1296 
Asn Lys Gin Ala Tyr Glu Glu He Ser Lys Glu His Leu Ala Val Tyr 
420 425 430 

AAG ATA CAA ATG TGT AAA AGT GTT AAA GTT CCA GGA ATA TGT ATT GAT 1344 
Lys He Gin Met Cys Lys Ser Val Lys Val Pro Gly He Cys He Asp 
435 440 445 

GTC GAT AAT GAA AAT TTG TTC TTT ATA GOT GAT AAA AAT AGT TTT TCA 1392 
Val Asp Asn Glu Asn Leu Phe Phe He Ala Asp Lys Asn Ser Phe Ser 
450 455 460 

GAT GAT TTA TCT AAA AAT GAA AGA GTA GAA TAT AAT ACA CAG AAT AAT 144 0 

OU Asp Asp Leu Ser Lys Asn Glu Arg Val Glu Tyr Asn Thr Gin Asn Asn 
465 • 470 475 480 

TAT ATA GGA AAT GAC TTT CCT ATA AAT GAA TTA ATT TTA GAT ACT GAT 148 8 

Tyr He Gly Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Asd 
485 ^ 



1536 
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GAT TTT AAT GTA GAT GTT CCA GTA TAT GAA AAA CAA CCC GCT ATA AAA 1584 
Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lys Gin Pro Ala He Lys 
515 520 525 

AAA GTT TTT ACA GAT GAA AAT ACC ATC TTT CAA TAT TTA TAC TCT CAG 1632 
Lys Val Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Leu Tyr Ser Gin 
530 535 540 

ACA TTT CCT CTA AAT ATA AG A GAT ATA AGT TTA ACA TCT TCA TTT GAT 1680 
Thr Phe Pro Leu Asn He Arg Asp He Ser Leu Thr Ser Ser Phe Asp 
545 550 555 560 

GAT GCA TTA TTA GTT TCT AGC AAA GTT TAT TCA TTT TTT TCT ATG GAT 1728 
Asp Ala Leu Leu Val Ser Ser Lys Val Tyr Ser Phe Phe Ser Met Asp 
565 570 575 

TAT ATT AAA ACT GCT AAT AAA GTA GTA GAA GCA GGA TTA TTT GCA GGT 1776 
Tyr He Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Gly 
580 585 590 

TGG GTG AAA CAG ATA GTA GAT GAT TTT GTA ATC GAA GCT AAT AAA AGC 1824 
Trp Val Lys Gin He Val Asp Asp Phe Val He Glu Ala Asn Lys Ser 
595 600 605 

AGT ACT ATG GAT AAA ATT GCA GAT ATA TCT CTA ATT GTT CCT TAT ATA 1872 
Ser Thr Met Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr He 
610 615 620 

GGA TTA GCT TTA AAT GTA GGA GAT GAA ACA GCT AAA GGA AAT TTT GAA 1920 
Gly Leu Ala Leu Asn Val Gly Asp Glu Thr Ala Lys Gly Asn Phe Glu 
625 630 635 640 

AGT GCT TTT GAG ATT GCA GGA TCC AGT ATT TTA CTA GAA TTT ATA CCA 1968 
Ser Ala Phe Glu He Ala Gly Ser Ser He Leu Leu Glu Phe He Pro 
645 650 655 

GAA CTT TTA ATA CCT GTA GTT GGA GTC TTT TTA TTA GAA TCA TAT ATT 2016 
Glu Leu Leu He Pro Val Val Gly Val Phe Leu Leu Glu Ser Tyr He 
660 665 670 

GAC AAT AAA AAT AAA ATT ATT AAA ACA ATA GAT AAT GCT TTA ACT AAA 2064 
Asp Asn Lys Asn Lys He He Lys Thr He Asp Asn Ala Leu Thr Lys 
675 680 685 

AGA GTG GAA AAA TGG ATT GAT ATG TAC GGA TTA ATA GTA GCG CAA TGG 2112 
Arg Val Glu Lys Trp He Asp Met Tyr Gly Leu He Val Ala Gin Trp 

690 695 700 

CTC TCA ACA GTT AAT ACT CAA TTT TAT ACA ATA AAA GAG GGA ATG TAT 2160 
Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He Lys Glu Gly Met Tyr 
705 710 715 720 

AAG GCT TTA AAT TAT CAA GCA CAA GCA TTG GAA GAA ATA ATA AAA TAC 2208 
Lys Ala Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu He He Lys Tyr 
725 730 735 

AAA TAT AAT ATA TAT TCT GAA GAG GAA AAG TCA AAT ATT AAC ATC AAT 2256 
Lys Tyr Asn He Tyr Ser Glu Glu Glu Lys Ser Asn He Asn He Asn 
740 745 750 

TTT AAT GAT ATA AAT TCT AAA CTT AAT GAT GGT ATT AAC CAA GCT ATG 2 3 04 

Phe Asn Asp He Asn Ser Lys Leu Asn Asp Gly He Asn Gin Ala Met 
755 760 765 

GAT AAT ATA AAT GAT TTT ATA AAT GAA TGT TCT GTA TCA TAT TTA ATG 23 52 

Asp Asn He Asn Asp Phe He Asn Glu Cys Ser Val Ser Tyr Leu Met 
770 775 780 

AAA AAA ATG ATT CCA TTA GCT GTA AAA AAA TTA CTA GAC TTT GAT AAT 24 00 

Lys Lys Met He Pro Leu Ala Val Lys Lys Leu Leu Asp Phe Asp Asn 
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■'90 795 800 

Thr ^lu ^■^'^ CAT GAA AAT AAA TTA TAT 

S '-y^ '-y*' "^V"^ Asp Glu Asn Lys Leu Tyr 

810 815 

TTA ATT GGA AGT GTA GAA GAT GAA AAA TCA AAA GTA GAT AAA TAG TTG 
Leu He Gly Ser Val Glu Asp Glu Lys Ser Lys Val Asp Lys Tyr Leu 
10 "° "25 830 

SJJ ?Sr iTI p^^ f*"" ""^^ ^^"^ '^TT GAA ATA 

Lys Thr lie He Pro Phe Asp Leu Ser Thr Tyr Ser Asn He Qlu He 
835 840 845 

Su lie ^ lit IT ^"^ '^G^ °AA ATT TTA AAT AAT ATT 

Leu lie Lys He Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He 

855 860 

->0 iTf 7'^^ "^^"^ '^A'^ TTA ATA GAT TTA TCA GGA 

' lit " ^^"3 "^y"" ^''S '^^P '^^^ Leu He Asp Leu Ser Glv 

870 875 880 

TAT GGA GCA AAG GTA GAG GTA TAT GAT GGG GTC AAG CTT AAT GAT AAA 
Tyr Gly Ala Lys Val Glu Val Tyr Asp Gly Val Lys Leu Asn A=p ^ 
885 890 995 



.10 



40 



45 



50 



60 



65 



900 



915 



930 



945 950 



96b 



980 



995 



1010 



1025 • 1030 



1045 



2448 



2496 



2544 



2592 



2640 



2688 



70 



' TCA 
Ser 


GCA 
Ala 
905 


GAT 
Asp 


AGT 
Ser 


AAG 
Lys 


ATT 
He 


AGA 
Arg 
910 


Val 


APT 

Thr 


2736 


' AAT 
Asn 
920 


AGT 
Ser 


ATG 
Met 


TTC 
Phe 


CTT GAT 
Leu Acp 
925 


TTT 
Phe 


AGC 
Ser 


Oil 

Val 




AAA 
Lys 


TAT 
Tyr 


AGG 
Arg 


AAT 
Asn 


GAT 
Asp 
94 0 


GAT 
Asp 


ATA 

He 


CAA 
Gin 


AAT 
Asn 


2632 


ATA 

He 


ATT 
He 


AAT 
Asn 


TGT 
Cys 
955 


ATG 
Met 


AAA 
Lys 


AAT 
Asn 


AAT 
Asn 


TCA 
Ser 
960 


2880 


GGT AAT 
Gly Asn 


AGG 
Arg 
970 


ATA 
He 


ATA 

He 


TGG ACC 
Trp Thr 


TTA 
Leu 
975 


ATT 

He 


2928 


TCA 
Ser 


GTA 
Val 
985 


TTT 
Phe 


TTT 
Phe 


GAA 
Glu 


TAT 

Tyr 


AAC 
Asn 
990 


ATA 
He 


AGA 
Arg 


2976 


AAT AGA 
Asn Arg 
1000 


TGG 
Trp 


TTT 
Phe 


TTT 
Phe 


GTA ACT 
Val Thr 

1005 


ATT 
He 


ACT 
Thr 


3024 


ATT 
lie 


TAT 
Tyr 


ATT 
He 


AAT 
Asn 


GGC ACG 
Gly Thr 
1020 


TTA 
Leu 


GAA 
Glu 


TCA 
Ser 


3072 


GGA 
Gly 


GAA 
Glu 


GTT 
Val 


ATT 
He 
1035 


GTT 
Val 


AAT 
Asn 


GGT 
Gly 


GAA 
Glu 


ATA 
He 
1040 


3120 


GTA 
Val 


GAT 
Asp 


AGA 
Arg 
1050 


ACA 
Thr 


CAA 
Gin 


TTT 
Phe 


ATT 
He 


TGG 
Trp 
1055 


ATG 
Met 


3168 


ACG 
Thr 


CAA 
Gin 
1065 


TTA 
Leu 


AAT 
Asn 


CAA 
Gin 


TCA 
Ser 


AAT 
Asn 
1070 


ATT 
He 


AAA 
Lys 


3216 
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35 



40 



45 



60 



65 



Glu Z"^ "^^^ "^^"^ TAC TTA AAA GAT TTT TGG 

T^^.;^^ ^^"^ ^y"^ Tyr Leu Lys Asp Phe Trp 

1080 1085 

PrI M*^? I*"^ ■^'^T "TAT ATG TTT AAT GCG GGG 

^ to9o "^'^ ^y"" "^"^ '^^^ "^y^ "fy^ Phe Asn Ala Gly 

1095 1100 



If3 ^'^ Sf* "^^"^ ^^"^ '^■r* ATA AAA GAA TAT GAT GAA CAG 

Ser Aon Glu Phe Tyr Lys Thr lie Glu He Lvs Glu Tyr Asp Glu Gin 
1205 1210 1215 

CCA TCA TAT AGT TGT CAG TTG CTT TTT AAA AAA GAT GAA GAA AGT ACT 
Pro Ser Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 
1220 1225 1230 

GAT GAT ATA GGA TTG ATT GGT ATT CAT CGT TTC TAC GAA TCT GGA GTT 
Asp Asp no Gly Leu He Gly He His Arg Phe Tvr Glu Ser Gly Val 
1235 1240 1245 

TTA CGT A/A AAG TAT AAA GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA 
Leu Arg Lys Lyo Tyr Lyo Asp Tyr Phe Cys He Ser Lys Trp Tvr Leu 
■'■'•^0 1255 1260 



CAG TTT ATT CCT AAA GAT GAA GGG TGG ACT GAA TAA 
Gin Phe He Pro Lys Asp Glu Gly Trp Thr Glu 
1265 1290 

(2) INFORMATION FOR SEQ ID NO; 40: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 40: 

Met Pro Val Thr He Asn Asn Phe Asn Tyr Asn Asp Pro He Asp Asn 



3264 



33i: 



AAT AAA AAT TCA TAT ATT AAA CTA GTG AAA GAT TCA TCT GTA GGT GAA :ncn 
10 Asn^Lys Asn Ser Tyr He^Lys Leu Val Lys Asp^Se? s"r Vai gJJ 

ATA TTA ATA CGT AGC AAA TAT AAT CAG AAT TCC AAT TAT ATA AAT TAT 
He Leu He Arg Ser Lys Tyr Asn Gin Aon Ser Asn Tyr He Tyr 
1^25 1130 1135 

AGA AAT TTA TAT ATT GGA GAA AAA TTT ATT ATA AGA AGA GAG TCA AAT 
Arg Asn Leu Tyr He Gly Glu Lys Phe He lie Arg Arg Glu Ser ^n 
20 * ° 11^5 1150 

TCT CAA TCT ATA AAT GAT GAT ATA GTT AGA AAA GAA GAT TAT ATA CAT 
ser Gin Ser He Asn Asp Asp He Val Arg Lys Glu Asp Tyr lie His 
1155 1160 1165 

25 CTA GAT TTG GTA CTT CAC CAT GAA GAG TGG AGA GTA TAT GCC TAT AAA 

Leu Asp Leu Val Leu His His Glu Glu Trp Arg Val Tyr Ala Tyr L^ 
^^fO 1175 1180 

TAT TTT AAG GAA CAG GAA GAA AAA TTG TTT TTA TCT ATT ATA Afi-^ raT -icnn 
Tyr Phe Lys Glu Gin Glu Glu Lys Lou Phe leu sfr l7e lie Ser AsJ 
^^^^ 1190 1195 1200 



3406 



3456 



3504 



3552 



3646 



3696 



3744 



3792 



SO ^ Sit ^ ^^"^ ^ ^""^ '^'^G TGT AAT TGG 3840 

Lys Glu 7al LyG Arg Lys Pro Tyr Lys Ser Asn Leu Gly Cys Asn Trp 

^^^^ 1270 1275 1280 



3876 



70 ^ ^ 10 15 
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Asp Asn lie lie Met 

20 



Met 



Glu Pro Pro Phe Ala Arg Gly Thr Gly Arg 

25 30 



Tyr Tyr Lys Ala Phe Lys He Thr Asp Arg He Trp He He Pro Glu 
35 40 45 

Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly 
50 55 60 

He Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 
65 70 75 80 

Thr Asn Asp Lys Lys Asn He Phe Phe Gin Thr Leu He Lys Leu Phe 

85 90 95 

Asn Arg He Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met He 
100 105 110 

He Asn Gly He Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 
115 120 125 

Phe Asn Thr Asn He Ala Ser Val Thr Val Asn Lys Leu He Ser Asn 
130 135 140 

Pro Gly Glu Val Glu Arg Lys Lys Gly He Phe Ala Asn Leu He He 
^'^^ 150 155 160 

Phe Gly Pro Gly Pro Val Leu Aon Glu Asn Glu Thr He Asp He Gly 
165 170 175 

He Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly He Met Gin 
180 185 190 

Met Lyo Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gin Glu 
195 200 205 

Asn Lys Gly Ala Ser He Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 
210 215 220 

Ala Leu He Leu Met His Glu Leu He His Val Leu His Gly Leu Tyr 
225 230 235 240 

Gly He Lys Val Asp Asp Leu Pro He Val Pro Asn Glu Lys Lys Phe 
245 250 255 

Phe Met Gin Ser Thr Asp Thr He Gin Ala Glu Glu Leu Tvr Thr Phe 
260 265 270 

Gly Gly Gin Asp Pro Ser He He Ser Pro Ser Thr Asp Lys Ser He 
275 280 285 

Tyr Asp Lys Val Leu Gin Asn Phe Arg Gly He Val Asp Arg Leu Asn 

290 295 300 

Lys Val Leu val Cys He Ser Asp Pro Asn He Asn He Asn He Tyr 
305 310 315 320 

Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Gly 
325 330 335 

Lys Tyr Ser He Asp Val Glu Ser Phe Asn Lys Leu Tyr Lys Ser Leu 
340 345 350 

Met Leu Gly Phe Thr Glu He Asn He Ala Glu Asn Tyr Lys He Lys 
355 360 365 

Thr Arg Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys He Lys 
370 375 380 

Asn Leu Leu Asp Asn Glu He Tyr Thr He Glu Glu Gly Phe Asn He 
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395 400 



Ser Asp Lys Asn Met Gly Lys Glu Tyr Arg Gly Gin Asn Lys Ala He 
405 410 415 

Asn Lys Gin Ala Tyr Glu Glu He Ser Lvs Glu His Leu Ala Val Tyr 
420 425 * 430 

Lys He Gin Met Cys Lys Ser Val Lys Val Pro Gly He Cys He Asp 
435 440 

Val Asp Asn Glu Asn Leu Phe Phe He Ala Asp Lys Asn Ser Phe Ser 
450 455 460 

Asp Asp Leu Ser Lys Asn Glu Arg Val Glu Tyr Aan Thr Gin Asn Asn 

470 475 480 

Tyr He Gly Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Asn 
485 490 495 ^ 

Leu He Ser Lys He Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 
500 505 510 

Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lys Gin Pro Ala He Lys 
5-^5 520 525 

Lys Val Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Leu Tvr Ser Gin 

535 540 

Thr Phe Pro Leu Asn He Arg Asp He Ser Leu Thr Sor Ser Phe Asd 
^"^^ 550 555 560 

Asp Ala Leu Leu Val Ser Ser Lys Val Tyr Ser Phe Phe Ser Met Asp 
565 570 575 

Tyr He Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Glv 
580 585 590 

Trp val Lys Gin He Val Asp Asp Phe Val He Glu Ala Asn Lys Ser 
595 600 605 

Ser Thr Met Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr He 

^^0 615 620 

Gly Leu Ala Leu Asn Val Gly Asp Glu Thr Ala Lys Gly Asn Phe Glu 

630 635 640 

Ser Ala Phe Glu He Ala Gly Ser Ser He Leu Leu Glu Phe He Pro 

645 650 655 

Glu Leu Leu He Pro Val Val Gly Val Phe Leu Leu Glu Ser Tyr He 

665 670 

Asp Asn Lys Asn Lys He He Lys Thr He Asp Asn Ala Leu Thr Lys 
675 680 685 

Arg Val Glu Lys Trp He Asp Met Tyr Gly Leu He Val Ala Gin Trn 
690 695 700 

Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He Lys Glu Gly Met Tvr 
705 710 715 720 

Lys Ala Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu He He Lvs Tyr 
725 730 735 

Lys Tyr Asn He Tyr Ser Glu Glu Glu Lys Ser Asn He Asn He Asn 
740 745 750 

Phe Asn Asp He Asn Ser Lys Leu Asn Asp Gly He Asn Gin Ala Met 
755 760 765 



- 289 - 



wo 98/08540 



PCT/US97/15394 



Asp Asn lie Asn Asp Phe He Asn Glu Cys Ser Val Ser Tyr Leu Met 

770 775 780 

Lys Lys Met He Pro Leu Ala Val Lys Lys Leu Leu Asp Phe Asp Asn 

790 795 800 

Thr Leu Lys Lys Asn Leu Leu Asn Tyr He Asp Glu Asn Lys Leu Tyr 

805 810 815 

Leu He Gly Ser Val Glu Asp Glu Lvs Ser Lys Val Asp Lvs Tyr Leu 
820 825 830 

Lys Thr He He Pro Phe Asp Leu Ser Thr Tyr Ser Asn He Glu He 
835 840 845 

Leu He Lys He Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He 
850 855 860 

He Leu Asn Leu Arg Tyr Arg Asp Asn Asn Leu He Asp Leu Ser Glv 
865 870 875 880 

Tyr Gly Ala Lys Val Glu Val Tyr Asp Gly Val Lys Leu Asn Asp Lys 
885 890 895 

Asn Gin Phe Lys Leu Thr Ser Sor Ala Asp Ser Lys He Arg Val Thr 
900 905 910 

Gin Asn Gin Asn He He Phe Asn Ser Met Phe Leu Asp Phe Ser Va] 
915 920 925 

Ser Phe Trp He Arg He Pro Lys Tyr Arg Aon Asp Asp He Gin Asn 
930 935 940 

Tyr He His Asn Glu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser 
^"^^ 950 955 960 

Gly Trp Lys He Ser He Arg Gly Asn Arg He He Trp Thr Leu He 
965 970 975 

Asp He Asn Gly Lyo Thr Lyo Ser Val Phe Phe Glu Tyr Asn He Arg 
980 985 990 

Glu Asp He Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr 
995 1000 1005 

Ann Asn Leu Asp Asn Ala Lys He Tyr He Asn Glv Thr Leu Glu Ser 
1010 1015 ' 1020 

Asn Met Asp He Lys Asp He Gly Glu Val He Val Asn Glv Glu He 
1025 1030 1035 ' 1040 

Thr Phe Lys Leu Asp Gly Asp Val Asp Arg Thr Gin Phe He Trp Met 
1045 1050 1055 

Lyo Tyr Phe Ser He Phe 7^sn Thr Gin Leu Asn Gin Ser Asn He Lys 
1060 1065 1070 

Glu He Tyr Lys He Gin Ser Tvr Ser Glu Tyr Leu Lys Asp Phe Trp 
1075 1080 1085 

Gly Asn Pro Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly 
1090 1095 1100 

Asn Lys Asn Ser Tyr He Lys Leu Val Lys Asp Ser Ser Val Gly Glu 
1105 1110 1115 1120 

He Leu He Arg Ser Lys Tyr Asn Gin Asn Ser Asn Tyr He Asn Tyr 
1125 1130 1135 

Arg Asn Leu Tyr He Gly Glu Lys Phe He He Arg Arg Glu Ser Asn 
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1140 1145 1150 

Ser Gin Ser He Asn Asp Asp He Val Arg Lys Glu Asp Tyr He His 
1155 1160 1165 

Leu Asp Leu Val Leu His His Glu Glu Trp Arg Vai Tyr Ala Tyr Lys 
1170 1175 1180 

Tyr Phe Lys- Glu Gin Glu Glu Lys Leu Phe Leu Ser He He Ser Asp 
1185 1190 1195 1200 

Ser Asn Glu Phe Tyr Lys Thr He Glu He Lys Glu Tvr Asp Glu Gin 
1205 1210 ' 1215 

Pro Ser Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 
1220 1225 1230 

Asp Asp He Gly Leu He Gly He His Arg Phe Tyr Glu Ser Gly Val 
1235 1240 1245 

Leu Arg Lys Lys Tyr Lys Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu 
1250 1255 1260 

Lys Glu Val Lys Arg Lys Pro Tyr Lys Ser Asn Leu Gly Cvs Asn Trp 
^265 1270 1275 ' 1280 

Gin Phe lie Pro Lys Asp Glu Gly Trp Thr Glu 
1285 1290 

(2) INFORMATION FOR SEQ ID N0:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 6 base pairs 

(B) TYPE: nucleic acid" 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 

(IX) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1, .38 73 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATG CCA GTT ACA ATA AAT AAT TTT AAT TAT AAT GAT CCT ATT GAT AAT 4 
Met Pro Val Thr He Asn Asn Phe Asn Tyr Asn Asp Pro He Asp Asn 
^5 10 15 

AAT AAT ATT ATT ATG ATG GAG CCT CCA TTT GCG AGA GOT ACG GGG AGA 9i 
Asn Asn He He Met Met Glu Pro Pro Phe Ala Arg Gly Thr Glv Arc 
20 25 30 

TAT TAT AAA GCT TTT AAA ATC ACA GAT CGT ATT TGG ATA ATA CCG GAA 14 
Tyr Tyr Lys Ala Phe Lys He Thr Asp Arg He Trp He He Pro Glu 
35 40 45 

AGA TAT ACT TTT GGA TAT AAA CCT GAG GAT TTT AAT AAA AGT TCC GGT 19 
Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Glv 
50 55 60 

ATT TTT AAT AGA GAT GTT TGT GAA TAT TAT GAT CCA GAT TAC TTA AAT 24 
He Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 
65 70 75 80 

ACT AAT GAT AAA AAG AAT ATA TTT TTA CAA ACA ATG ATC AAG TTA TTT 2 8 

Thr Asn Asp Lys Lys Asn He Phe Leu Gin Thr Met He Lys Leu Phe 
85 90 95 

AAT AGA ATC AAA TCA AAA CCA TTG GGT GAA AAG TTA TTA GAG ATG ATT 3 3 
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20 



Asn Arg He Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met He 

100 ^j^o 

S tT- "^^"^ AGA CGT GTT CCA CTC GAA GAG 

lie Asn Gly He Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 

115 120 125 

lli ^'^'^ '^^'^ °TT AAT AAA TTA ATC AGT AAT 

10 ^^"^ '^^^ ^^"^ val Asn Lys Leu He Ser Asn 

135 X40 

CCA GGA GAA GTG GAG CGA AAA AAA GGT ATT TTC GCA AAT TTA ATA ATA 
Pro Gly Glu Val Glu Arg Lys Lys Gly He Phe Ala Asn lIu lie lie 
15 155 

r?v ^?'' n'''' P^^T ^'^ ATA GAT ATA GGT 

Phe Oly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr He Asp He Gly 

165 170 3^75 

tT^ 5^"^ ir '^^^ ''^^ ^GC TTC GGG GGT ATA ATG CAA 

He Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly He Met Gin 
IBO 185 190 

ATG AAG TTT TGC CCA GAA TAT GTA AGC GTA TTT AAT AAT GTT CAA GAA 
Met Lys Pne Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gin Glu 
195 200 205 

AAC AAA GGC GCA AGT ATA TTT AAT AGA CGT GGA TAT TTT TCA GAT CCA 
Asn Lys Gly Ala Ser He Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 
-10 215 220 

GCC TTG ATA TTA ATG CAT GAA CTT ATA CAT GTT TTA CAT GGA TTA TAT 
Ala Leu He Leu Met His Glu Leu He His Val Leu Hie Gly Leu Tyr 

230 235 240 

GGC ATT AAA GTA GAT GAT TTA CCA ATT GTA CCA AAT GAA AAA AAA TTT 
Oly He Lys Val Asp Asp Leu Pro He Val Pro Asn Glu Lys Lys Phe 
245 250 255 

TTT ATG CAA TCT ACA GAT GCT ATA CAG GCA GAA GAA CTA TAT ACA TTT 
Phe Met Gin Ser Thr Asp Ala He Gin Ala Glu Glu Leu Tyr Thr Phe 
260 265 270 

4S n?^ n^'^ ^'^^ ^^'^ ^^'^ TCT ACG GAT AAA AGT ATC 864 

H.-^ Gly Gly Gin Asp Pro Ser He He Thr Pro Ser Thr Asp Lys Ser He 
275 280 285 



30 



40 



60 



TAT GAT AAA GTT TTG CAA AAT TTT AGA GGG ATA GTT GAT AGA CTT AAC 

Sn '^^'^ ^f? ^'-^^ ^^'^ Val Asp Arg Leu Asn 

290 295 300 



ACT AGA GCT TCT TAT TTT AGT GAT TCC TTA CCA CCA GTA AAA ATA AAA 
Thr Arg Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys He Lvs 
570 375 380 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



912 



960 



1008 



AAG GTT TTA GTT TGC ATA TCA GAT CCT AAC ATT AAT ATT AAT ATA TAT 
Lys Val Leu Val Cys He Ser Asp Pro Asn He Asn He Asn He Tyr 

310 315 320 

AAA AAT AAA TTT AAA GAT AAA TAT AAA TTC GTT GAA GAT TCT GAG GGA 
Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Glv 
325 330 335 ^ 

AAA TAT AGT ATA GAT GTA GAA AGT TTT GAT AAA TTA TAT AAA AGC TTA 1056 
Lys Tyr Sfer He Asp Val Glu Ser Phe Asp Lys Leu Tyr Lvs Ser Leu 
340 345 350 

ATG TTT GGT TTT ACA GAA ACT AAT ATA GCA GAA AAT TAT AAA ATA AAA 1104 
Met Phe Gly Phe Thr Glu Thr Asn He Ala Glu Asn Tyr Lys He Lys 
355 360 365 



1152 
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AAT TTA TTA GAT AAT GAA ATC TAT ACT ATA GAG GAA GGG TTT AAT ATA 1200 
Asn Leu Leu Asp Asn Glu lie Tyr Thr He Glu Glu Gly Phe Asn He 
385 390 395 400 

TCT GAT AAA GAT ATG GAA AAA GAA TAT AGA GGT CAG AAT AAA GCT ATA 1248 
Ser Asp Lys Asp Met Glu Lys Glu Tyr Arg Glv Gin Asn Lys Ala He 
405 410 ' 415 

AAT AAA CAA GCT TAT GAA GAA ATT AGC AAG GAG CAT TTG GCT GTA TAT 1296 
Asn Lys Gin Ala Tyr Glu Glu He Ser Lys Glu His Leu Ala Val Tyr 
420 425 430 

AAG ATA CAA ATG TGT AAA AGT GTT AAA GCT CCA GGA ATA TGT ATT GAT 1344 
Lys He Gin Met Cys Lys Ser Val Lys Ala Pro Gly He Cys He Asp 
1-^ 435 440 445 ^ 



10 



20 



GTT GAT AAT GAA GAT TTG TTC TTT ATA GCT GAT AAA AAT AGT TTT TCA 1392 
Val Asp Asn Glu Asp Leu Phe Phe He Ala Asp Lys Asn Ser Phe Ser 
450 455 460 

GAT GAT TTA TCT AAA AAC GAA AGA ATA GAA TAT AAT ACA CAG AGT AAT 1440 
Asp Asp Leu Ser Lys Asn Glu Arg He Glu Tvr Asn Thr Gin Ser Asn 
^^^5 470 475 480 

25 TAT ATA GAA AAT GAC TTC CCT ATA AAT GAA TTA ATT TTA GAT ACT GAT 1488 

Tyr He Glu Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Aso 
485 490 495 

^ TTA ATA AGT AAA ATA GAA TTA CCA AGT GAA AAT ACA GAA TCA CTT ACT 1536 

M) Leu He Ser Lys He Glu Leu Pro ser Glu Asn Thr Glu Ser Leu Thr 
500 505 510 

GAT TTT AAT GTA GAT GTT CCA GTA TAT GAA AAA CAA CCC GCT ATA AAA 1584 
^ Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lys Gin Pro Ala He Lys 

-'^^ 515 520 525 



40 



50 



60 



65 



70 



AAA ATT TTT ACA GAT GAA AAT ACC ATC TTT CAA TAT TTA TAC TCT CAG 1632 

Lys He Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Leu Tyr Ser Gin 
530 535 540 

ACA TTT CTC TTA GAT ATA AGA GAT ATA AGT TTA ACA TCT TCA TTT GAT 1680 

Thr Phe Leu Leu Asp He Arg Asp He Ser Leu Thr Ser Ser Phe Asp 
545 550 555 560 



45 GAT GCA TTA TTA TTT TCT AAC AAA GTT TAT TCA TTT TTT TCT ATG GAT 

Asp Ala Leu Leu Phe Ser Asn Lys Val Tyr Ser Phe Phe Ser Met Asp 
565 570 57B 



1728 



TAT ATT AAA ACT GCT AAT AAA GTG GTA GAA GCA GGA TTA TTT GCA GGT 17 76 

Tyr He Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Gly 
580 585 590 



TGG GTG AAA CAG ATA GTA AAT GAT TTT GTA ATC GAA GCT AAT AAA AGC 1824 
Trp Val Lys Gin He Val Asn Asp Phe Val He Glu Ala Asn Lys Ser 
-^-^ 595 600 605 



AAT ACT ATG GAT AAA ATT GCA GAT ATA TCT CTA ATT GTT CCT TAT ATA 1872 
Asn Thr Met Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr He 
610 615 620 

GGA TTA GCT TTA AAT GTA GGA AAT GAA ACA GCT AAA GGA AAT TTT GAA 1920 
Gly Leu Ala Leu Asn Val Gly Asn Glu Thr Ala Lys Gly Asn Phe Glu 
625 630 635 640 

AAT GCT TTT GAG ATT GCA GGA GCC AGT ATT CTA CTA GAA TTT ATA CCA 1968 
Asn Ala Phe Glu He Ala Gly Ala Ser He Leu Leu Glu Phe He Pro 
645 650 655 

GAA CTT TTA ATA CCT GTA GTT GGA GCC TTT TTA TTA GAA TCA TAT ATT 2016 
Glu Leu Lou He Pro Val Val Gly Ala Phe Leu Leu Glu Ser Tyr He 
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660 665 



6 70 



GAC AAT AAA AAT AAA ATT ATT AAA ACA ATA GAT AAT GCT TTA ACT AAA 
ASP Asn Lys Asn Lys He He Lys Thr lie Asp Asn Ala Leu Thr LyJ 

680 685 

AGA AAT GAA AAA TGG AGT GAT ATG TAC GGA TTA ATA GTA GCG CAA TGG 
Arg Asn Glu Lys Trp Ser Asp Met Tyr Gly Leu lie Val Ala Gin Trp 

695 700 

etc TCA ACA GTT AAT ACT CAA TTT TAT ACA ATA AAA GAG GGA ATG TAT 
Leu ser Thr Val Asn Thr Gin Phe Tyr Thr lie Lys Glu Gly Met Tyr 

AAG GCT TTA AAT TAT CAA GCA CAA GCA TTG GAA GAA ATA ATA AAA TAC 
uys Ala Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu He lie Lys Tyr 

725 730 735 

AGA TAT AAT ATA TAT TCT GAA AAA GAA AAG TCA AAT ATT AAC ATC GAT 
Arg Tyr Asn lie Tyr Ser Glu Lys Glu Lys Ser Asn He Asn lie Asp 
''^O 745 750 

TTT AAT GAT ATA AAT TCT AAA CTT AAT GAG GGT ATT AAC CAA GCT ATA 
Phe Asn Asp He Asn Ser Lys Leu Asn Glu Gly He Asn Gin Ala He 
^55 760 765 

GAT AAT ATA AAT AAT TTT ATA AAT GGA TGT TCT GTA TCA TAT TTA ATG 
Asp Asn He Ann Asn Phe He Asn Gly Cys Ser Val Ser Tyr Leu Met 
''0 775 780 

/\AA AAA ATG ATT CCA TTA GCT GTA GAA AAA TTA CTA GAC TTT GAT AAT 
-ys I.ys Met He Pro Leu Ala Val Glu Lys Leu Leu Asp Phe Asp Asn 

790 795 800 

ACT CTC AAA AAA AAT TTG TTA AAT TAT ATA GAT GAA AAT AAA TTA TAT 
Thr Leu Lys Lys Asn Leu Leu Asn Tyr He Asp Glu Asn Lys Leu Tyr 
805 810 815 

TTG ATT GGA AGT GCA GAA TAT GAA AAA TCA AAA GTA AAT AAA TAC TTG 
i^eu He Gly Ser Ala Glu Tyr Glu Lys Ser Lys Val Asn Lys Tyr Leu 
820 825 830 

AAA ACC ATT ATG CCG TTT GAT CTT TCA ATA TAT ACC AAT GAT ACA ATA 2 544 

Lys Thr He Met Pro Phe Asp Lou Ser He Tyr Thr Asn Asp Thr He 
^35 840 845 

CTA ATA GrvA ATG TTT AAT AAA TAT AAT AGC GAA ATT TTA AAT AAT ATT 
Leu He Glu Met Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He 
850 855 860 

ATC TTA AAT TTA AGA TAT AAG GAT AAT AAT TTA ATA GAT TTA TCA GGA 
He Leu Asn Leu Arg Tyr Lys Asp Asn Asn Leu He Asp Leu Ser Glv 

870 875 880 

TAT GGG GCA AAG GTA GAG GTA TAT GAT GGA GTC GAG CTT AAT GAT AAA 
Tyr Gly Ala Lys Val Glu Val Tyr Asp Gly Val Glu Leu Asn Asp Lys 
885 890 895 

AAT CAA TTT AAA TTA ACT AGT TCA GCA AAT AGT AAG ATT AGA GTG ACT 
Asn Gin Phe Lys Leu Thr Ser Ser Ala Asn Ser Lys He Arg Val Thr 
900 905 910 

CAA AAT CAG AAT ATC ATA TTT AAT AGT GTG TTC CTT GAT TTT AGC GTT 
Gin Asn Gin Asn He He Phe Asn Ser Val Phe Leu Asp Phe Ser Val 
915 920 925 

AGC TTT TGG ATA AGA ATA CCT AAA TAT AAG AAT GAT GGT ATA CAA AAT 
Ser Phe Trp He Arg He Pro Lys Tyr Lys Asn Asp Gly He Gin Asn 
930 935 



2064 



2112 



2160 



2208 



;!256 



2304 



2352 



2400 



2448 



2496 



2592 



2640 



2688 



2736 



2784 



2832 
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10 



20 



.10 



40 



50 



60 



TAT ATT CAT AAT GAA TAT ACA ATA ATT AAT TGT ATG AAA AAT AAT TCG 2 880 

Tyr He His Asn Glu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser 

950 955 960 

GGC TGG AAA ATA TCT ATT AGG GGT AAT AGG ATA ATA TGG ACT TTA ATT 2928 
Gly Trp Lys He Ser He Arg Gly Asn Arg He He Trp Thr Leu He 
965 970 975 

GAT ATA AAT GGA AAA ACC AAA TCG GTA TTT TTT GAA TAT AAC ATA AGA 9976 
Asp He Asn Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn He Arg 
980 985 990 



GAA GAT ATA TCA GAG TAT ATA AAT AGA TGG TTT TTT GTA ACT ATT ACT 3024 
Glu Asp He Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr 
995 1000 1005 



3120 



AAT AAT TTG AAT AAC GCT AAA ATT TAT ATT AAT GGT AAG CTA GAA TCA 3072 
Asn Asn Leu Asn Asn Ala Lys He Tyr He Asn Gly Lys Leu Glu Ser 
1010 1015 1020 

AAT ACA GAT ATT AAA GAT ATA AGA GAA GTT ATT GCT AAT GGT GAA ATA 
Asn Thr Asp He Lys Asp He Arg Glu Val He Ala Asn Gly Glu He 
^^25 1030 1035 1040 

ATA TTT AAA TTA GAT GGT GAT ATA GAT AGA ACA CAA TTT ATT TGG ATG 3168 
He Phe Lys Leu Asp Gly Asp He Asp Arg Thr Gin Phe He Trp Met 
1045 1050 1055 

AAA TAT TTC AGT ATT TTT AAT ACG GAA TTA AGT CAA TCA AAT ATT GAA 3 "Jig 

Lys Tyr Phe Ser He Phe Asn Thr Glu Leu Ser Gin Ser Asn He Glu 
1060 1065 1070 

GAA AGA TAT AAA ATT CAA TCA TAT AGC GAA TAT TTA AAA GAT TTT TGG 
Glu Arg Tyr Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trn 
-^^ 1075 1080 1085 

GGA AAT CCT TTA ATG TAC AAT AAA GAA TAT TAT ATG TTT AAT GCG GGG 3312 
Gly Asn Pro Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly 
1090 1095 1100 

AAT AAA AAT TCA TAT ATT AAA CTA AAG AAA GAT TCA CCT GTA GGT GAA 3360 
Asn Lys Asn Ser Tyr He Lye Leu Lys Lys Asp Ser Pro Val Gly Glu 
1105 1110 1115 ^^20 

45 ATT TTA ACA CGT AGC AAA TAT AAT CAA AAT TCT AAA TAT ATA AAT TAT 34 08 

He Leu Thr Arg Ser Lys Tyr Asn Gin Asn Ser Lvs Tyr He Asn Tyr 
1125 1130 * 1135 



3264 



AGA GAT TTA TAT ATT GGA GAA AAA TTT ATT ATA AGA AGA AAG TCA AAT 34 56 

Arg Asp Leu Tyr He Gly Glu Lys Phe He He Arg Arg Lys Ser Asn 
1140 1145 1150 

TCT CAA TCT ATA AAT GAT GAT ATA GTT AGA AAA GAA GAT TAT ATA TAT 3 504 

Ser Gin Ser He Asn Asp Asp He Val Arg Lys Glu Asp Tyr He Tvr 
1155 1160 1165 

CTA GAT TTT TTT AAT TTA AAT CAA GAG TGG AGA GTA TAT ACC TAT AAA 3 552 

Leu Asp Phe Phe Asn Leu Acn Gin Glu Trp Arg Val Tyr Thr Tyr Lvs 
1170 1175 1180 

TAT TTT AAG AAA GAG GAA GAA AAA TTG TTT TTA GCT CCT ATA AGT GAT 
Tyr Phe Lys Lys Glu Glu Glu Lys Leu Phe Leu Ala Pro He Ser Asd 
1185 1190 1195 1200 

65 TCT GAT GAG TTT TAC AAT ACT ATA CAA ATA AAA GAA TAT GAT GAA CAG 3648 

Ser Asp Glu Phe Tyr Asn Thr He Gin He Lys Glu Tyr Asp Giu Gin 
1205 1210 1215 



3600 
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'p^^ c^'^ '^^'^ ^'^'^ '^'^'^ ^ AAA GAT GAA GAA AGT ACT 3696 

Pro Thr Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 
1220 1225 1230 

GAT GAG ATA GGA TTG ATT GGT ATT CAT CGT TTC TAC GAA TCT GGA ATT 3744 
Asp Glu He Gly Leu He Gly He His Arg Phe Tyr Glu Ser Gly He 
1235 1240 1245 

GTA TTT GAA GAG TAT AAA GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA 3792 
Val Phe Glu Glu Tyr Lys Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu 
1250 1255 1260 

AAA GAG GTA AAA AGG AAA CCA TAT AAT TTA AAA TTG GGA TGT AAT TGG 3840 
Lys Glu Val Lys Arg Lys Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp 
1265 1270 1275 1280 

CAG TTT ATT CCT AAA GAT GAA GGG TGG ACT GAA TAA 107^ 
Gin Phe He Pro Lys Asp Glu Gly Trp Thr Glu 
1285 1290 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Met Pro Val Thr He Asn Asn Phe Asn Tyr Asn Asp Pro He Asp Asn 
^5 10 15 

Asn Asn He He Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arg 
20 25 30 

Tyr Tyr Lys Ala Phe Lys He Thr Asp Arg He Trp He He Pro Glu 
35 40 45 

Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly 
50 55 60 

He Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 

65 70 75 80 

Thr Asn Asp Lys Lya Asn He Phe Leu Gin Thr Met He Lys Leu Phe 
85 90 95 

Asn Arg He Lys Ser Lys Pro Leu Glv Glu Lys Leu Leu Glu Met He 
100 105 1x0 

He Asn Gly He Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 
115 120 125 

Phe Asn Thr Asn He Ala Ser Val Thr Val Asn Lys Leu He Ser Asn 
130 13S 140 

Pro Gly Glu Val Glu Arg Lys Lys Gly He Phe Ala Asn Leu He He 
145 150 155 X60 
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Phe Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr lie Asp He Gly 
165 170 175 

He Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly He Met Gin 
180 185 190 

Met Lys Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gin Glu 
195 200 205 

Asn Lys Gly Ala Ser He Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 
210 215 220 

Ala Lou He Leu Met His Glu Leu He His Val Leu His Gly Leu Tvr 
-25 230 235 240 

Gly He Lys Val Asp Asp Leu Pro He Val Pro Asn Glu Lys Lys Phe 
245 250 255 

Phe Met Gin Ser Thr Asp Ala He Gin Ala Glu Glu Leu Tyr Thr Phe 
260 265 270 

Gly Gly Gin Asp Pro Ser He He Thr Pro Ser Thr Asp Lys Ser He 
275 280 285 

Tyr Asp Lys Val Leu Gin Asn Phe Arg Gly He Val Asp Arg Leu Asn 
290 295 300 

Lys Val Leu Val Cys He Ser Asp Pro Asn He Asn He Asn He Tvr 
305 310 315 320 

Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Glv 
325 330 335 

Lys Tyr Ser He Asp Val Glu Ser Phe Asp Lys Leu Tvr Lys Ser Leu 
340 345 ' 

Met Phe Gly Phe Thr Glu Thr Asn He Ala Glu Asn Tyr Lys He Lvs 
355 360 365 

Thr Arg Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys He Lvs 

370 375 380 

Asn Leu Leu Asp Asn Glu He Tyr Thr He Glu Glu Gly Phe Asn He 
385 390 395 400 

Ser Asp Lys Asp Met Glu Lys Glu Tyr Arg Glv Gin Asn Lvs Ala He 
405 410 " ' 41S 

Asn Lys Gin Ala Tyr Glu Glu He Ser Lys Glu His Leu Ala Val Tyr 
420 425 430 

Lys He Gin Met Cys Lys Ser Val Lys Ala Pro Glv He Cys He Asp 
435 440 '445 

Val Asp Asn Glu Asp Leu Phe Phe He Ala Asp Lys Asn Ser Phe Ser 
450 455 460 

Asp Asp Leu Ser Lys Asn Glu Arg He Glu Tyr Asn Thr Gin Ser Asn 
^^^^5 470 475 480 

Tyr He Glu Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Asp 
485 490 495 

Leu He Ser Lys He Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 
500 505 510 

Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lys Gin Pro Ala He Lvs 
515 520 525 

Lys He Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Leu Tyr Ser Gin 
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530 535 



540 



Thr Phe Leu Leu Asp He Arg Asp lie Ser Leu Thr Ser Ser Phe Asp 
^ 555 

Asp Aia Leu Leu Phe Ser Asn Lys Val Tyr Ser Phe Phe Ser Met Asp 
565 570 

Tyr He Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Glv 

585 590 

Trp Val Lys Gin He Val Asn Asp Phe Val He Glu Ala Asn I,ys Ser 
595 600 605 

Asn Thr Met Asp Lys He Ala Asp He Ser Lou He Val Pro Tyr He 

615 620 

Gly Leu Ala Leu Asn Val Gly Asn Glu Thr Ala Lys Gly Asn Phe Glu 

630 635 640 

Asn Ala Phe Glu He Ala Gly Ala Ser He Leu Leu Glu Phe He Pro 
645 650 655 

Glu Leu Leu He Pro Val Val Gly Ala Phe Leu Lou Glu Ser Tyr He 
660 665 670 

Asp Asn Lys Asn Lys He He Lys Thr He Asp Asn Ala Leu Thr Lys 
6"^5 680 685 

Arg Asn Glu Lys Trp Ser Asp Met Tyr Gly Leu He Val Ala Gin Trp 
690 695 700 

Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He Lys Glu Gly Met Tyr 

715 720 



Lys AJa Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu He He Lv; Tvr 
725 730 735 ' 

Arg Tyr Asn He Tyr Ser Glu Lys Glu Lys Ser Asn He Aon He Asp 
V40 745 750 

Phe Asn Asp He Asn Ser Lys Leu Asn Glu Gly He Asn Gin Ala He 
755 760 765 

Asp Asn He Asn Asn Phe He Asn Gly Cys Ser Val Ser Tyr Leu Met 
770 775 780 

Lys Lys Met He Pro Leu Ala Val Glu Lys Leu Leu Asp Phe Asp Asn 

790 795 800 

Thr Leu Lys Lys Asn Leu Leu Asn Tyr He Asp Glu Asn Lys Leu Tyr 
805 810 815 

Leu He Gly Ser Ala Glu Tyr Glu Lys Ser Lys Val Asn Lys Tyr Leu 
820 825 830 

Lys Thr He Met Pro Phe Asp Leu Ser He Tyr Thr Asn Asp Thr He 
835 840 845 

Leu He Glu Met Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He 

850 855 860 

He Leu Asn Leu Arg Tyr Lys Asp Asn Asn Leu He Asp Leu Ser Glv 
®^5 870 875 880 

Tyr Gly Ala Lys Val Glu Val Tyr Asp Gly Val Glu Leu Asn Asp Lvs 
885 890 895 ' 

Asn Gin Phe Lys Leu Thr Ser Ser Ala Asn Ser Lys He Arg Val Thr 
900 905 910 
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Gin Asn Gin Asn He He Phe Asn Ser Val Phe Leu Asp Phe Ser Val 
915 920 925 

Ser Phe Trp He Arg lie Pro Lys Tyr Lys Asn Asp Gly He Gin Asn 
^30 935 g^Q 

Tyr He His Asn Glu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser 

950 955 960 

Gly Trp Lys He Ser He Arg Gly Asn Arg He He Trp Thr Leu He 
965 970 975 

Asp He Asn Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn He Arq 
980 985 990 

Glu Asp He ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr 
^95 1000 lOOS 

Asn Asn^Leu Asn Asn Ala ^.ys^He Tyr He Asn Gly Lys Leu Glu Ser 

Asn Thr Asp He Lys Asp He Arg Glu Val He Ala Asn Gly Glu He 

1030 1035 1040 

He Pne Lys Leu Asp Gly Asp He Asp Arg Thr Gin Phe He Trp Met 
1045 1050 1055 

Lys Tyr Phe Ser He Phe Asn Thr Glu Leu Ser Gin Ser Asn He Glu 
1060 1065 1070 

Ulu Arg Tyr Lys He Gin Ser Tyr Ser Glu Tyr Leu Lvs Asp Phe Trp 
10^5 1080 1085 

^^noo'''^*' '^'^'^ Tyr Met Phe Asn Ala Gly 

1095 1100 

Asn Lys Asn Ser Tyr He Lys Leu Lya Lys Asp Ser Pro Val Gly Glu 
^^0^ 1110 HIS 1120 

He Leu Thr Arg ser Lys Tyr Asn Gin Asn Ser Lys. Tyr He Asn Tyr 

1125 1130 

Arg Asp Leu Tyr He Gly Glu Lys Phe He He Arg Arg Lys Ser Asn 
1140 1145 1150 

Ser Gin Ser He Asn Asp Asp He Val Arg Lys Glu Asp Ty>- He Tvr 

1155 1160 1165 

Leu Asp Phe Phe Asn Leu Asn Gin Glu Trp Arg Val Tyr Thr Tyr Lys 
■••1'" 1175 1180 

Tyr Phe Lys Lys Glu Glu Glu Lys Leu Phe Leu Ala Pro He Ser Asd 
^^^^ 1190 1195 1200 

ser Asp Glu Phe Tyr Asn Thr He Gin He Lys Glu Tyr Asp Glu Gin 
1205 1210 1215 

Pro Thr Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 
1220 1225 1230 

Asp Glu He Gly Leu He Gly He His Arg Phe Tyr Glu Ser Gly He 
1235 1240 1245 

t*UL°^" '^^^ "^y"" ^y'' Phe Cys He Ser Lys Trp Tyr Leu 

12SS 1260 

Lys Glu val Lys Arg Lys Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp 

1270 1275 1280 

Gin Phe He Pro Lys Asp Glu Gly Trp Thr Glu 
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lie Asn Arg Trp Phe Phe Val Thr lie Thr Asn Asn Leu Asp Asn Ala 
180 185 190 195 

AAA ATT TAT ATT AAT GGC ACG TTA GAA TCA AAT ATG GAT ATT AAA GAT 
Lys He Tyr He Asn Gly Thr Leu Giu Ser Asn Met Asp He Lys Asp 
200 205 210 

ATA GGA GAA GTT ATT GTT AAT GGT GAA ATA ACA TTT AAA TTA GAT GGT 
He Gly Glu Val He Val Asn Gly Glu He Thr Phe Lys Leu Asp Gly 
215 220 225 

CAT GTA GAT AGA ACA CAA TTT ATT TGG ATG AAA TAT TTT AGT ATT TTT 
Asp Val Asp Arg Thr Gin Phe lie Trp Met Lys Tvr Phe Ser He Phe 
230 235 240 

AAT ACG CAA TTA AAT CAA TCA AAT ATT AAA GAG ATA TAT AAA ATT CAA 
Asn Thr Gin Leu Asn Gin Ser Asn He Lys Glu He Tyr Lys He Gin 
245 250 255 

TCA TAT AGC GAA TAC TTA AAA GAT TTT TGG GGA AAT CCT TTA ATG TAT 
Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp Gly Asn Pro Leu Met Tyr 
^oO 265 270 275 

AAT AAA GAA TAT TAT ATG TTT AAT GCG GGG AAT AAA AAT TCA TAT ATT 
Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly Asn Lvs Asn Ser Tyr He 
280 285 290 

AAA CTA GTG AAA GAT TCA TCT GTA GGT GAA ATA TTA ATA CGT AGC AAA 
Lys Leu Val Lys Asp Ser Ser Val Gly Glu He Leu He Arg Ser Lvs 
295 300 305 

TAT AAT CAG AAT TCC AAT TAT ATA AAT TAT AGA AAT TTA TAT ATT GGA 
Tyr Asn Gin Asn Ser Asn Tyr He Asn Tyr Arg Asn Leu Tyr He Giv 

315 320 

GAA AAA TTT ATT ATA AGA AGA GAG TCA AAT TCT CAA TCT ATA AAT GAT 
Glu Lys Phe He He Arg Arg Glu Ser Asn Ser Gin Sor He Asn Asp 
^-fcS 330 335 

GAT ATA GTT AGA AAA GAA GAT TAT ATA CAT CTA GAT TTG GTA CTT CAC 
Asp lie Val Arg Lys Glu Asp Tyr He His Leu Asp Leu Val Leu His 

345 350 355 

CAT GAA GAG TGG AGA GTA TAT GCC TAT AAA TAT TTT AAG GAA CAG GAA 
His Glu Glu Trp Arg Val Tyr Ala Tyr Lys Tyr Phe Lys Glu Gin Glu 
360 365 370 

GAA AAA TTG TTT TTA TCT ATT ATA AGT GAT TCT AAT GAA TTT TAT AAG 
Glu Lys Leu Phe Leu Ser He He Ser Asp Ser Asn Glu Phe Tvr Lvs 
375 380 385 

ACT ATA GAA ATA AAA GAA TAT GAT GAA CAG CCA TCA TAT AGT TGT CAG 
Thr He Glu He Lys Giu Tyr Asp Glu Gin Pro Ser Tyr Ser Cys Gin 
390 395 400 

TTG CTT TTT AAA AAA GAT GAA GAA AGT ACT GAT GAT ATA GGA TTG ATT 
Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Asp He Gly Leu He 
405 410 415 

GGT ATT CAT CGT TTC TAC GAA TCT GGA GTT TTA CGT AAA AAG TAT AAA 
Gly He His Arg Phe Tyr Glu Ser Gly Val Leu Arg Lys Lys Tyr Lys 

425 430 435 

GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA AAA GAG GTA AAA AGG AAA 
Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu Lys Glu Val Lys Arq Lys 
440 445 450 

CCA TAT AAG TCA AAT TTG GGA TGT AAT TGG CAG TTT ATT CCT AAA GAT 
Pro Tyr Lys Ser Asn Leu Gly Cys Asn Trp Gin Phe He Pro Lys Asd 
455 460 465 



740 



788 



836 



884 



932 



980 



1028 



1076 



1124 



117; 



1220 



1268 



1316 



1364 



1412 



1460 



1508 
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GAA GGG TGG ACT GAA TAA 
Glu Gly Trp Thr Glu 
470 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: procein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:44: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
1 5 10 15 

lie Glu Gly Arg His Met Ala Ser Met Ala Asp Thr lie Leu lie Glu 
20 25 30 

Met Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He He Leu Asn 
35 40 45 

Leu Arg Tyr Arg Asp Asn Asn Leu lie Asp Leu Ser Gly Tyr Glv Ala 
50 55 60 

Lys Vai Glu Val Tyr Asp Gly Val Lys Leu Asn Asp Lys Asn Gin Phe 
°5 70 75 80 

Lys Leu Thr Ser Ser Ala Asp Ser Lys He Arg Val Thr Gin Asn Gin 
85 90 95 

Asn He He Phe Asn Ser Met Phe Leu Asp Phe Ser Val Ser Phe Trp 
100 105 HO 

He Arg He Pro Lyn Tyr Arg Asn Asp Asp He Gin Asn Tyr He His 
115 120 125 

Asn Giu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser Gly Trp Lys 
130 135 140 

He Ser He Arg Gly Asn Arg He He Trp Thr Leu He Asp He Asn 

150 155 160 

Cfly Lvs Thr Lys Ser Val Phe Phe Glu Tyr Asn He Arg Glu Asp He 
165 170 175 

Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr Asn Asn Leu 
180 185 190 

Asp Asn Ala Lys He Tyr He Asn Gly Thr Leu Glu Ser Asn Met Asp 
195 200 205 

He Lys Asp He Gly Glu Val He Val Asn Gly Glu He Thr Phe Lys 
210 215 220 

Leu Asp Gly Asp Val Asp Arg Thr Gin Phe He Trp Met Lvs Tvr Phe 
225 230 235 * ' 240 

Ser He Phe Asn Thr Gin Leu Asn Gin Ser Asn He Lvs Glu He Tyr 
245 250 ' 255 

Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp Gly Asn Pro 
260 265 270 

Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly Asn Lys Asn 
275 280 285 

Ser Tyr He Lys Leu Val Lys Asp Ser Ser Val Gly Glu Ho Leu He 
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290 295 300 

Arg Ser Lys Tyr Asn Gin Asn Ser Asn Tyr He Asn Tyr Arg Asn Leu 
305 310 315 320 

Tyr He Gly Glu Lys Phe He He Arg Arg Glu Ser Asn Ser Gin Ser 
325 330 335 

He Asn Asp Asp He Val Arg Lys Glu Asp Tyr He His Leu Asp Leu 
340 345 350 

Val Leu His His Glu Glu Trp Arg Val Tyr Ala Tyr Lys Tyr Phe Lys 
355 360 365 

Glu Gin Glu Glu Lys Leu Phe Leu Ser He He Ser Asp Ser Asn Glu 
370 375 380 

Phe Tyr Lys Thr He Glu He Lys Glu Tvr Asp Glu Gin Pro Ser Tyr 
390 395 400 

Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Asp He 
405 410 415 

Gly Leu He Gly He His Arg Phe Tyr Glu Ser Gly Val Leu Arg Lys 
420 425 430 

Lys Tyr Lys Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu Lys Glu Val 
435 440 445 

Lys Arg Lys Pro Tyr Lys Ser Asn Leu Gly Cys Asn Trp Gin Phe lie 
450 455 460 

Pro Lys Asp Glu Gly Trp Thr Glu 
465 470 

<2) INFORMATION FOR SEQ ID NO:45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 108.. 1523 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4 5: 

AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 
His His His His His His His His His Ser Ser Gly His He Glu Gly 
5 10 15 

CGT CAT ATG GCT AGC ATG GCT GAT ACA ATA CTA ATA GAA ATG TTT AAT 
Arg His Met Ala Ser Met Ala Asp Thr He Leu He Glu Met Phe Asn 
20 25 30 35 

AAA TAT AAT AGC GAA ATT TTA AAT AAT ATT ATC TTA AAT TTA AGA TAT 
Lys Tyr Asn Ser Glu He Leu Asn Asn He He Leu Asn Leu Arq Tvr 
40 45 50 

AAG GAT AAT AAT TTA ATA GAT TTA TCA GGA TAT GGG GCA AAG GTA GAG 
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Lys Asp Asn Asn Leu He Asp Leu Ser Gly Tyr Gly Ala Lys Val Glu 

55 60 65 

vll Itl ttl n?"" ^^'^ ^'"'^ ^ TTT AAA TTA ACT 

Val Tyr Asp Gly Val Glu Leu Asn Asp Lys Asn Gin Phe Lys Leu Thr 

75 80 

AGT TCA GCA AAT AGT AAG ATT AGA GTG ACT CAA AAT CAG AAT ATC ATA 
Ser Ser Ala Asn Ser Lys lie Arg Val Thr Gin Asn Gin Asn He He 
«5 90 95 

TTT AAT AGT GTG TTC CTT GAT TTT AGC GTT AGC TTT TGG ATA ACA ATA 
Phe Asn ser Val Phe Leu Asp Phe Ser Val Ser Phe Trp He Arg He 

CCT AAA TAT AAG AAT GAT GGT ATA CAA AAT TAT ATT CAT AAT GAA TAT 
Pro Lys Tyr Lys Asn Asp Gly He Gin Asn Tyr He His Asn Glu Tyr 
120 125 130 

ACA ATA ATT AAT TGT ATG AAA AAT AAT TCG GGC TGG AAA ATA TCT ATT 
Thr He He Asn Cys Met Lys Asn Asn Ser Gly Trp Lys He Ser He 
135 140 3^45 

AGG GGT AAT AGG ATA ATA TGG ACT TTA ATT GAT ATA AAT GGA AAA ACC 
Arg Gly Asn Arg He He Trp Thr Leu He Asp lie Asn Gly Lys Thr 
150 155 160 

AAA TCG GTA TTT TTT GAA TAT AAC ATA AGA GAA GAT ATA TCA GAG TAT 
Lys Ser Vnl Phe Phe Glu Tyr Asn He Arg Glu Asp He Ser Glu Tyr 
1€5 170 

ATA AAT AGA TGG TTT TTT GTA ACT ATT ACT AAT AAT TTG AAT AAC GCT 
He Asn Arg Trp Phe Phe Val Thr He Thr Asn Asn Leu Asn Asn Ala 

185 190 

AAA ATT TAT ATT AAT GGT AAG CTA GAA TCA AAT ACA GAT ATT AAA GAT 
Lys He Tyr He Asn Gly Lys Leu Glu Ser Asn Thr Asp He Lvs Asp 
200 205 210 

ATA AGA GAA GTT ATT GCT AAT GGT GAA ATA ATA TTT AAA TTA GAT GGT 
He Arg Glu Val He Ala Asn Gly Glu He He Phe Lys Leu Asp Gly 
215 220 225 

(;AT ATA GAT AGA ACA CAA TTT ATT TGG ATG AAA TAT TTC AGT ATT TTT 
Asp He Asp Arg Thr Gin Phe He Trp Met Lys Tvr Phe Ser He Phe 
230 235 ' 240 

AAT ACG GAA TTA AGT CAA TCA AAT ATT GAA GAA ACA TAT AAA ATT CAA 
Asn Thr Glu Leu Ser Gin Ser Asn He Glu Glu Arg Tyr Lys He Gin 
245 250 255 

TCA TAT AGC GAA TAT TTA AAA GAT TTT TGG GGA AAT CCT TTA ATG TAC 
Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp Gly Asn Pro Leu Met Tvr 
260 265 270 275 

AAT AAA GAA TAT TAT ATG TTT AAT GCG GGG AAT AAA AAT TCA TAT ATT 
Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly Asn Lys Asn Ser Tyr He 
280 285 290 

AAA CTA AAG AAA GAT TCA CCT GTA GGT GAA ATT TTA ACA CGT AGC AAA 
Lys Leu Lys Lys Asp Ser Pro Val Gly Glu He Leu Thr Arg Ser Lys 
295 300 305 

TAT AAT CAA AAT TCT AAA TAT ATA AAT TAT AGA GAT TTA TAT ATT GGA 
Tyr Asn Gin Asn Ser Lys Tyr He Asn Tyr Arg Asp Leu Tyr He Gly 
310 315 320 
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GAA AAA TTT ATT ATA AGA AGA AAG TCA AAT TCT CAA TCT ATA AAT GAT 1124 
Glu Lys Phe He He Arg Arg Lys Ser Asn Ser Gin Ser He Asn Asp 
325 330 335 

GAT ATA GTT AGA AAA GAA GAT TAT ATA TAT CTA GAT TTT TTT AAT TTA 1172 
Asp He Val Arg Lys Glu Asp Tyr He Tyr Leu Asp Phe Phe Asn Leu 
340 345 350 355 

AAT CAA GAG TGG AGA GTA TAT ACC TAT AAA TAT TTT AAG AAA GAG GAA 1220 
Asn Gin Glu Trp Arg Val Tyr Thr Tyr Lys Tyr Phe Lys Lys Glu Glu 
360 365 370 

GAA AAA TTG TTT TTA GCT CCT ATA AGT GAT TCT GAT GAG TTT TAC AAT 1268 
Glu Lys Leu Phe Leu Ala Pro He Ser Asp Ser Asp Glu Phe Tyr Asn 
375 380 385 

ACT ATA CAA ATA AAA GAA TAT GAT GAA CAG CCA ACA TAT AGT TGT CAG 1316 
Thr He Gin He Lys Glu Tyr Asp Glu Gin Pro Thr Tyr Ser Cys Gin 
390 395 400 

TTG CTT TTT AAA AAA GAT GAA GAA AGT ACT GAT GAG ATA GGA TTG ATT 1364 
Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Glu He Gly Leu He 
405 410 415 

25 GGT ATT CAT CGT TTC TAC GAA TCT GGA ATT GTA TTT GAA GAG TAT AAA 1412 

Gly He His Arg Phe Tyr Glu Ser Gly He Val Phe Glu Glu Tyr Lys 
**20 425 430 435 



10 



15 



20 



30 



40 



60 



GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA AAA GAG GTA AAA AGG AAA 14 60 

Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu Lys Glu Val Lys Arg Lys 
440 445 450 



CCA TAT AAT TTA AAA TTG GGA TGT AAT TGG CAG TTT ATT CCT AAA GAT 1508 
Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp Gin Phe He Pro Lys Asp 
-^^ 455 460 465 



CAA GGG TGG ACT GAA TAAAAGCTTG CGGCCGCACT CGAG 154 7 

Glu Gly Trp Thr Glu 

470 

(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 Ui) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
15 10 15 

S:^ He Glu Gly Arg His Met Ala Ser Met Ala Asp Thr He Leu He Glu 

20 25 30 



Met Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He He Leu Asn 
35 40 45 

Leu Arg Tyr Lys Asp Asn Asn Leu He Asp Leu Ser Gly Tvr Gly Ala 
50 55 60 



Lys Val Glu Val Tyr Asp Gly Val Glu Leu Asn Asp Lys Asn Gin Phe 
0> 65 70 75 80 



70 



Lys Leu Thr Ser Ser Ala Asn Ser Lys He Arg Val Thr Gin Asn Gin 
85 90 95 

Asn He He Phe Asn Ser Val Phe Leu Asp Phe Ser Val Ser Phe Trp 
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100 



105 



110 



10 



15 



20 



2:> 



30 



35 



40 



45 



50 



60 



()5 



70 



He Arg He Pro Lys Tyr Lys Asn Asp Gly He Gin Asn Tyr He His 
115 120 125 

Asn Glu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser Giy Trp Lvs 
130 135 140 ^ . 

He Ser He Arg Gly Asn Arg He He Trp Thr Leu lie Asp He Asn 
"5 150 155 160 

Gly Ly£3 Thr Lys Ser Val Phe Phe Glu Tyr Asn He Arg Giu A§p lie 
165 170 175 

ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr Asn Asn Leu 
180 185 190 

Asn Asn Ala Lys He Tyr He Asn Gly Lys Leu Glu Ser Asn Thr Asd 
195 200 205 

He Lys Asp He Arg Glu Val He Ala Asn Gly Glu He He Phe Lvs 
210 215 220 

Leu Asp Giy Asp He Asp Arg Thr Gin Phe He Trp Met Lys Tyr Phe 
•^^^ 230 235 ' 240 

.Ser He Phe Asn Thr Glu Leu Ser Gin Ser Asn He Glu Glu Arg Tyr 
245 250 255 

Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp Gly Asn Pro 
260 265 270 

Leu Met Tyr Asn Lyo Glu Tyr Tyr Met Phe Asn Ala Gly Asn Lys Asn 
275 280 285 

Ser Tyr He Lys Leu Lys Lys Asp Ser Pro Val Glv Glu He Leu Thr 
-90 295 300 

Arg Ser Lys Tyr Asn Gin Asn Ser Lys Tyr He Aon Tyr Arg Asp Leu 
^05 310 315 320 

Tyr He Gly Glu Lys Phe He He Arg Arg Lys Ser Asn Ser Gin Ser 
325 330 335 

lie Asn Asp Asp He Val Arg Lys Glu Asp Tyr He Tyr Leu Asp Phe 
340 345 350 

Phe Asn Ltiu Asn Gin Giu Trp Arq Val Tyr Thr Tyr Lys Tyr Phe Lvs 

355 360 365 

Lys Glu Glu Glu Lys Leu Phe Leu Ala Pro He Ser Asp Ser Asp Glu 
370 375 380 

Phe Tyr Asn Thr He Gin He Lys Glu Tyr Asp Glu Gin Pro Thr Tvr 

390 395 400 

Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asd Glu He 
405 410 " 415 

Gly Leu lie Gly He His Arg Phe T>^'r Glu Ser Gly He Val Phe Glu 
420 425 430 

Glu Tyr Lys Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu Lys Giu Val 
435 440 445 

Lys Arg Lys Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp Gin Phe He 
450 455 460 

Pro Lys Asp Glu Gly Trp Thr Glu 
465 470 
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30 



40 



50 



(2) INFORMATION FOR SEO ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 
{B) TYPE: nucleic acid 
(CJ STRANDEDNESS : single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47; 



CGCCATGGCT GATACAATAC TAATAGAAAT G ^1 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 2 9 base pairs 
-0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

^- tii) MOLECULE TYPE: other nucleic acid 

<A) DESCRIPTION: /desc - *'DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48; 

GCAAGCTTTT ATTCAGTCCA CCCTTCATC 2 9 

(2) INFORMATION FOR SEQ ID NO:49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3753 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 



Ui) MOLECULE TYPE: DNA (genomic) 

(IX) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 3750 



70 



GAT AAA TTT TTA AAA ATA GTC ACA AAA ATA TTT AAT AGA ATA A/vT GAT 
Asp Lys Phe Leu Lys lie Val Thr Lys He Phe Asn Arg He Asn Asp 
85 90 95 



48 



96 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 9 : 

ATG CCA ACA ATT AAT AGT TTT AAT TAT AAT GAT CCT GTT AAT AAT AGA 
Met Pro Thr He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asn Arg 
^5 10 15 

ACA ATT TTA TAT ATT AAA CCA GGC GGT TGT CAA CAA TTT TAT AAA TCA 
Thr He Leu Tyr He Lys Pro Gly Gly Cys Gin Gin Phe Tyr Lys Ser 
20 25 30 

55 TTT AAT ATT ATG AAA AAT ATT TGG ATA ATT CCA GAG AGA AAT GTA ATT 

Phe Asn He Met Lys Asn He Trp He He Pro Glu Arg Asn Val He 
35 40 45 

GGT ACA ATT CCC CAA GAT TTT CTT CCG CCT ACT TCA TTG AAA AAT GGA 192 
W Gly Thr He Pro Gin Asp Phe Leu Pro Pro Thr Ser Leu Lys Asn Glv 
50 55 60 

GAT AGT AGT TAT TAT GAC CCT AAT TAT TTA CAA AGT GAT CAA GAA AAG 
Asp Ser Ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp Gin Glu Lys 
"-^ 65 70 75 80 



144 



240 



288 
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30 



50 



fiO 



65 



70 



AAT CTT TCA GGA AGG ATT TTA TTA GAA GAA CTG TCA AAA GCT AAT CCA 
Asn Leu Ser Gly Arg He Leu Leu Glu Glu Leu Ser Lys Ala Asn Pro 
100 105 110 

TAT TTA GGA AAT GAT AAT ACT CCA GAT GGT GAC TTC ATT ATT AAT GAT 
Tyr Leu Gly Asn Asp Asn Thr Pro Asp Gly Asp Phe He He Asn Asp 
115 120 125 



336 



384 



480 



528 



576 



GCA TCA GCA GTT CCA ATT CAA TTC TCA AAT GGT AGC CAA AGC ATA CTA 432 
M) Ala ser Ala Val Pro He Gin Phe Ser Asn Gly Ser Gin Ser He Leu 
130 135 140 

TTA CCT AAT GTT ATT ATA ATG GGA GCA GAG CCT GAT TTA TTT GAA ACT 
M V!c ^""^ ^""^ Pro Asp Leu Phe Glu Thr 

150 155 160 

AAC AGT TCC AAT ATT TCT CTA AGA AAT AAT TAT ATG CCA AGC AAT CAC 
Asn Ser Ser Asn He Ser Leu Arg Asn Asn Tyr Met Pro Ser Asn His 
20 170 175 

GGT TTT GGA TCA ATA GCT ATA GTA ACA TTC TCA CCT GAA TAT TCT TTT 
Gly Phe Gly Ser He Ala He Val Thr Phe Ser Pro Glu Tyr Ser Phe 
180 185 190 

2.^ AGA TTT AAA GAT AAT AGT ATG AAT GAA TTT ATT CAA GAT CCT GCT CTT 624 

Arg Phe Lys Asp Asn Ser Met Asn Glu Phe He Gin Asp Pro Ala Leu 
195 200 205 

ACA TTA ATG CAT GAA TTA ATA CAT TCA TTA CAT GGA CTA TAT GGG GCT 
Thr Leu Met His Glu Leu He His Ser Leu His Gly Leu Tyr Gly Ala 
'■^10 215 220 

AAA GGG ATT ACT ACA AAG TAT ACT ATA ACA CAA AAA CAA AAT CCC CTA 
^ Lys Gly He Thr Thr Lys Tyr Thr He Thr Gin Lys Gin Asn Pro Leu 

-'"^ 225 230 235 240 

ATA ACA AAT ATA AGA GGT ACA AAT ATT GAA GAA TTC TTA ACT TTT GGA 768 
He Thr Asn He Arg Gly Thr Asn He Glu Glu Phe Leu Thr Phe Gly 
245 250 255 

GGT ACT GAT TTA AAC ATT ATT ACT AGT GCT CAG TCC AAT GAT ATC TAT 816 
Gly Thr Asp Leu Asn He He Thr Ser Ala Gin Ser Asn Asp He Tyr 
260 265 270 

45 ACT AAT CTT CTA GCT GAT TAT AAA AAA ATA GCG TCT AAA CTT AGC AAA 864 

Thr Agu Leu Leu Ala Asp Tyr Lys Lys He Ala Ser Lys Leu Ser Lys 
275 280 285 



672 



720 



GTA CAA GTA TCT AAT CCA CTA CTT AAT CCT TAT AAA GAT GTT TTT GAA 912 
Val Gin Val Ser Asn Pro Leu Leu Asn Pro Tyr Lys Asp Val Phe Glu 
290 295 300 

GCA AAG TAT GGA TTA GAT AAA GAT GCT AGC GGA ATT TAT TCG GTA AAT 960 
Ala Lys Tyr Gly Leu Asp Lys Asp Ala Ser Gly He Tyr Ser Val Asn 
305 310 315 320 

ATA AAC AAA TTT AAT GAT ATT TTT AAA AAA TTA TAC AGC TTT ACG GAA 1008 
He Asn Lys Phe Asn Asp He Phe Lys Lys Leu Tyr Ser Phe Thr Glu 

325 330 335 

TTT GAT TTA GCA ACT AAA TTT CAA GTT AAA TGT AGG CAA ACT TAT ATT 1056 
Phe Asp Leu Ala Thr Lys Phe Gin Val Lys Cys Arg Gin Thr Tyr He 
340 345 350 

GGA CAG TAT AAA TAC TTC AAA CTT TCA AAC TTG TTA AAT GAT TCT ATT 1X04 
Gly Gin Tyr Lys Tyr Phe Lys Leu Ser Asn Leu Leu Asn Asp Ser He 
355 360 365 

TAT AAT ATA TCA GAA GGC TAT AAT ATA AAT AAT TTA AAG GTA AAT TTT 1152 
Tyr Asn He Ser Glu Gly Tyr Asn He Asn Asn Leu Lys Val Asn Phe 
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370 375 380 

AGA GGA CAG AAT GCA AAT TTA AAT CCT AGA ATT ATT ACA CCA ATT ACA 1200 
Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg lie He Thr Pro He Thr 
385 390 395 400 

GGT AGA GGA CTA GTA AAA AAA ATC ATT AGA TTT TGT AAA AAT ATT GTT 1248 
Gly Arg Gly Leu Val Lys Lys He He Arg Phe Cys Lys Asn He Val 
405 410 415 

TCT GTA AAA GGC ATA AGG AAA TCA ATA TGT ATC GAA ATA AAT AAT GGT 1296 
Ser Val Lys Gly He Arg Lys Ser He Cys He Glu He Asn Asn Gly 
420 425 430 

GAG TTA TTT TTT GTG OCT TCC GAG AAT AGT TAT AAT GAT GAT AAT ATA 134 4 

Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tyr Asn Asp Asp Asn He 
435 440 445 

AAT ACT CCT AAA GAA ATT GAC GAT ACA GTA ACT TCA AAT AAT AAT TAT 1392 
Asn Thr Pro Lys Glu He Asp Asp Thr Val Thr Ser Asn Asn Asn Tyr 
450 455 460 

GAA AAT GAT TTA GAT CAG GTT ATT TTA AAT TTT AAT AGT GAA TCA GCA 144 0 

Glu Asn Asp Leu Asp Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 
465 470 475 480 

CCT GGA CTT TCA GAT GAA AAA TTA AAT TTA ACT ATC CAA AAT GAT GCT 1488 
Pro Gly Leu Ser Asp Glu Lys Leu Aon Leu Thr He Gin Asn Asd Ala 
485 490 495 

TAT ATA CCA AAA TAT GAT TCT AAT GGA ACA AGT GAT ATA GAA CAA CAT 1536 
Tyr He Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin Has 
500 505 510 

35 GAT GTT AAT GAA CTT AAT GTA TTT TTC TAT TTA GAT GCA CAG AAA GTG 1584 

Asp Val Asn Glu Leu Asn Val Phe Phe Tyr Leu Asp Ala Gin Lys Val 
515 520 525 



20 



30 



40 



50 



60 



CCC GAA GGT GAA AAT AAT GTC AAT CTC ACC TCT TCA ATT GAT ACA GCA 1632 
Pro Glu Gly Glu Asn Asn Val Asn Leu Thr Ser Ser He Asp Thr Ala 
530 535 540 



TTA TTA GAA CAA CCT AAA ATA TAT ACA TTT TTT TCA TCA GAA TTT ATT 1680 
Leu Leu Glu Gin Pro Lys He Tyr Thr Phe Phe Ser Ser Glu Phe He 
545 550 555 560 



AAT AAT GTC AAT AAA CCT GTG CAA GCA GCA TTA TTT GTA AGC TGG ATA 1728 
Asn Asn Val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Trp He 
565 570 575 

CAA CAA GTA TTA GTA GAT TTT ACT ACT GAA GCT AAC CAA AAA AGT ACT 1776 
Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 

580 585 590 

55 GTT GAT AAA ATT GCA GAT ATT TCT ATA GTT GTT CCA TAT ATA GGT CTT 1824 

Val Asp Lys He Ala Asp He Ser He Val Val Pro Tyr He Gly Leu 

595 600 605 



GCT TTA AAT ATA GGA AAT GAA GCA CAA AAA GGA AAT TTT AAA GAT GCA 18 72 

Ala Leu Asn He Gly Asn Glu Ala Gin Lys Gly Asn Phe Lvs Asp Ala 
610 615 620 



CTT GAA TTA TTA GGA GCA GGT ATT TTA TTA GAA TTT GAA CCC GAG CTT 192 0 

Leu Glu Leu Leu Gly Ala Gly He Leu Leu Glu Phe Glu Pro Glu Leu 
625 630 635 640 



70 



TTA ATT CCT ACA ATT TTA GTA TTC ACG ATA AAA TCT TTT TTA GGT TCA 1968 
Leu He Pro Thr He Leu Val Phe Thr He Lys Ser Phe Leu Gly Ser 
645 650 655 
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TCT GAT AAT AAA AAT AAA GTT ATT AAA GCA ATA AAT AAT GCA TTG AAA 2016 
Ser Asp Asn Lys Asn Lys Val He Lys Ala lie Asn Asn Ala Leu Lys 
660 665 670 

GAA AGA GAT GAA AAA TGG AAA GAA GTA TAT AGT TTT ATA GTA TCG AAT 2064 
Glu Arg Asp Glu Lys Trp Lys Glu Val Tyr Ser Phe He Val Ser Asn 
675 680 685 

TGG ATG ACT AAA ATT AAT ACA CAA TTT AAT AAA AGA AAA GAA CAA ATG 2112 
Trp Met Thr Lys lie Asn Thr Gin Phe Asn Lys Arg Lys Glu Gin Met 
690 695 700 

TAT CAA GCT TTA CAA AAT CAA GTA AAT GCA CTT AAA GCA ATA ATA GAA 2160 
Tyr Gin Ala Leu Gin Asn Gin Val Asn Ala Leu Lys Ala He He Glu 
''^^ 710 715 720 

TCT AAG TAT AAT AGT TAT ACT TTA GAA GAA AAA AAT GAG CTT ACA AAT 2208 
Ser Lys Tyr Asn Ser Tyr Thr Leu Glu Glu Lys Asn Glu Leu Thr Asn 
725 730 735 

AAA TAT GAT ATT GAG CAA ATA GAA AAT GAA CTT AAT CAA AAG GTT TCT 2256 
Lys Tyr Asp He Glu Gin He Glu Asn Glu Leu Asn Gin Lys Val Ser 
740 745 750 

ATA GCA ATG AAT AAT ATA GAC AGG TTC TTA ACT GAA AGT TCT ATA TCT 2304 
He AJa Met Asn Asn He Asp Arg Phe Leu Thr Glu Ser Ser He Ser 
755 760 765 

TAT TTA ATG AAA TTA ATA AAT GAA GTA AAA ATT AAT AAA TTA AGA GAA 23 52 

Tyr Leu Met Lys Leu He Asn Glu Val Lys He Asn Lys Leu Arq Glu 
770 775 780 

TAT GAT GAA AAT GTT AAA ACG TAT TTA TTA GAT TAT ATT ATA AAA CAT 24 00 

Tyr Asp Glu Asn Val Lys Thr Tyr Leu Leu Asp Tyr He He Lys His 

790 795 800 

GGA TCA ATC TTG GGA GAG AGT CAG CAA GAA CTA AAT TCT ATG GTA ATT 244 8 

Gly Ser He Leu Gly Glu Ser Gin Gin Glu Leu Asn ser Met Val He 
80S 810 815 

GAT ACC CTA AAT AAT AGT ATT CCT TTT AAG CTT TCT TCT TAT ACA GAT 24 96 

Asp Thr Leu Asn Asn Ser He Pro Phe Lys Leu Ser Ser Tyr Thr Asp 
820 825 830 

GAT AAA ATT TTA ATT TCA TAT TTT AAT AAG TTC TTT AAG AGA ATT AAA ->54 4 

Asp Lys He Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arq He Lyc 
835 840 845 

AGT AGT TCT GTT TTA AAT ATG AGA TAT AAA AAT GAT AAA TAC GTA GAT 2592 
Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Aap 
850 855 860 

ACT TCA GGA TAT GAT TCA AAT ATA AAT ATT AAT GGA GAT GTA TAT AAA 264 0 

Thr Ser Gly Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lvs 
865 870 875 880 

TAT CCA ACT AAT AAA AAT CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT 2688 
Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 
885 890 895 

GAA GTT AAT ATA TCT CAA AAT GAT TAC ATT ATA TAT GAT AAT AAA TAT 2736 
Glu Val Asn He Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lyc Tyr 
900 905 910 

AAA AAT TTT AGT ATT AGT TTT TGG GTA AGA ATT CCT AAC TAT GAT AAT 2784 
Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 
915 920 925 

AAG ATA GTA AAT GTT AAT AAT GAA TAC ACT ATA ATA AAT TGT ATG AGG 28 3-> 

Lys He Val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arg 
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GAT AAT AAT TCA GGA TGG AAA GTA TCT CTT AAT CAT AAT GAA ATA ATT 
Asp Asn Asn Ser Gly Trp Lys Val Set Leu Asn His Asn Glu He He 

950 955 ggo 

TGG ACA TTG CAA GAT AAT TCA GGA ATT AAT CAA AAA TTA GCA TTT AAC 
Trp Thr Leu Gin Asp Asn Ser Gly He Asn Gin Lys Leu Ala Phe Asn 
965 970 975 

TAT GGT AAC GCA AAT GGT ATT TCT GAT TAT ATA AAT AAG TGG ATT TTT 
Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp Ue Phe 
980 985 990 

GTA ACT ATA ACT AAT GAT AGA TTA GGA GAT TCT AAA CTT TAT ATT AAT 
Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
995 1000 1005 

GGA AAT TTA ATA GAT AAA AAA TCA ATT TTA AAT TTA GGT AAT ATT CAT 
Gly Asn Leu He Asp Lys Lys Ser He Leu Asn Leu Gly Asn He His 
^010 1015 1020 

GTT AGT GAC AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA 
y^ir Val Asn Cys Ser Tyr Thr Arq 

1030 1035 1040 

l^l nl ?nl I'"'' GAT AAA GAA TTA GAT GAA 3168 

Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
1045 1050 1055 

ACA GAA ATT CAA ACT TTA TAT AAC AAT GAA CCT AAT GCA AAT ATT TTA 
Thr Glu He Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn He Leu 
1060 1065 1070 

AAG GAT TTT TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA 
Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tvr Leu 
1075 1080 1085 

TTA AAT GTG TTA AAA CCA AAT AAC TTT ATT AAT AGG AGA ACA GAT TCT 
Leu Asn Val Leu Lys Pro Asn Asn Phe He Asn Arg Arg Thr Asp Ser 
1090 1095 1100 

ACT TTA AGC ATT AAT AAT ATA AGA AGC ACT ATT CTT TTA GCT AAT AGA 
Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arq 
1105 1110 1115 ^^Iq 

TTA TAT AGT GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT 3408 
Leu T^/r Ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
1125 1130 1135 

ACT AAC GAT AAT CTT GTT AGA AAG AAT GAT CAG GTA TAT ATT AAT TTT 34 56 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
1140 1145 1150 

GTA GCC AGC AAA ACT CAC TTA CTT CCA TTA TAT GCT GAT ACA GCT ACC 
Val Ala Ser Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr 
1155 1160 1165 

ACA AAT AAA GAG AAA ACA ATA AAA ATA TCA TCA TCT GGC AAT AGA TTT 
Thr Asn Lys Glu Lys Thr He Lys He Ser Ser Ser Gly Asn Arq Phe 
1170 - 1175 1180 

AAT CAA GTA GTA GTT ATG AAT TCA GTA GGA TGT ACA ATG AAT TTT AAA 
Asn Gin Val Val val Met Asn Ser Val Gly Cys Thr Met Asn Phe Lvs 

1190 1195 1200 
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AAT AAT AAT GGA AAT AAT ATT GGG TTG TTA GGT TTC AAG GCA GAT ACT 3648 
Asn Asn Asn Gly Asn Asn lie Gly Leu Leu Glv Phe Lvs Ala Asp Thr 
1205 1210 ' 12X5 

?7J ?r '^^^ '^^'^ '^^'^ ^AT ATG AGA GAT AAT ACA AAC 3696 

Val val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr Asn 
1220 1225 1230 

AGC AAT GGA TTT TTT TGG AAC TTT ATT TCT GAA GAA CAT GGA TGG CAA 3744 
Ser Asn Gly Phe Phe Trp Asn Phe lie Ser Glu Glu His Gly Trp Gin 
1235 1240 1245 



GAA AAA TAA 
Glu Lys 
1250 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 50 : 

Met Pro Thr He Asn Ser Phe Asn Tyr Asn Asp Pro Val Aon Asn Arg 
^5 10 15 

Thr He Leu Tyr He Lys Pro Gly Gly Cys Gin Gin Phe Tyr Lys Ser 
20 25 30 

Phe Asn He Met Lys Asn He Trp lie He Pro Glu Arg Asn Val He 
35 40 45 

Gly Thr He Pro Gin Asp Phe Leu Pro Pro Thr Ser Leu Lys Asn Gly 
50 55 60 

Asp Ser Ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp Gin Glu Lys 
^5 70 75 80 

Asp Lys Phe Leu Lys He Val Thr Lys He Phe Asn Arg He Asn Asp 
85 90 95 

Asn Leu Ser Gly Arg He Leu Leu Glu Glu Leu Ser Lys Ala Asn Pro 
100 105 110 



Tyr Leu Gly Asn Asp Asn Thr Pro Asp Gly Asp Phe He He Asn Asp 

115 120 125 

Ala Ser Ala Val Pro He Gin Phe Ser Asn Gly Ser Gin Ser He Leu 

130 135 140 



Leu Pro Asn Val He He Met Gly Ala Glu Pro Asp Leu Phe Glu Thr 
145 150 155 160 

Asn Ser Ser Asn He Ser Leu Arg Asn Asn Tyr Met Pro Ser Asn His 
165 170 175 

Gly Phe Gly Ser He Ala He Val Thr Phe Ser Pro Glu Tyr Ser Phe 
180 185 190 

Arg Phe Lys Asp Asn Ser Met Asn Glu Phe He Gin Asp Pro Ala Leu 
195 200 205 

Thr Leu Met His Glu Leu He His Ser Leu His Gly Leu Tyr Gly Ala 
210 215 220 

Lys Gly He Thr Thr Lys Tyr Thr He Thr Gin Lys Gin Asn Pro Leu 
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225 230 235 240 

He Thr Asn He Arg Gly Thr Asn He Glu Glu Phe Leu Thr Phe Gly 
245 250 255 

Gly Thr Asp Leu Asn He He Thr Ser Ala Gin Ser Asn Asp He Tyr 
260 265 270 

Thr Asn Leu Leu Ala Asp Tyr Lys Lys He Ala Ser Lys Leu Ser Lys 
275 280 285 

Val Gin Val Ser Asn Pro Leu Leu Asn Pro Tyr Lys Asp Val Ehe Glu 
290 295 300 

Ala Lys Tyr Gly Leu Asp Lys Asp Ala Ser Gly He Tyr Ser Val Asn 

310 315 

He Asn Lys Phe Asn Asp He Phe Lys Lys Leu Tyr Ser Phe Thr Glu 
325 330 335 

Phe Asp Leu Ala Thr Lys Phe Gin Val Lys Cys Arg Gin Thr Tyr He 
340 345 350 

Gly Gin Tyr Lys Tyr Phe Lys Leu Ser Asn Leu Leu Asn Asp Ser He 
^55 360 365 

Tyr Asn He Ser Glu Gly Tyr Asn He Asn Asn Leu Lys Val Asn Phe 
370 375 380 

Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg He He Thr Pro He Thr 
385 390 395 400 

Gly Arg Gly Leu Val Lys Lys He He Arg Phe Cys Lys Asn He Val 
405 410 415 

Ser val Lys Gly He Arg Lys Ser He Cys He Glu He Asn Asn Gly 
420 425 430 

Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tyr Asn Asp Asp Asn He 
435 440 445 

Asn Thr Pro Lyo Glu He Asp Asp Thr Val Thr Ser Asn Asn Asn Tvr 
450 455 460 

Glu Asn Asp Leu Asp Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 
465 470 475 480 

Pro Gly Leu Ser Asp Glu Lys Leu Asn Leu Thr He Gin Asn Asp Ala 
485 490 495 

Tyr He Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin Hio 
500 505 510 

Asp Val Asn Glu Leu Asn Val Phe Phe Tyr Leu Asp Ala Gin Lys Val 
515 520 525 

Pro Glu Gly Glu Asn Asn Val Asn Leu Thr Ser Ser He Asp Thr Ala 
530 535 540 

Leu Leu Glu Gin Pro Lys He Tyr Thr Phe Phe Ser Ser Glu Phe He 
545 550 555 560 

Asn Asn Val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Trp He 
565 570 575 

Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 
580 585 590 

Val Asp Lys He Ala Asp He Ser He Val Val Pro Tyr He Gly Leu 
595 600 605 
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Ala Leu Asn He Gly Asn Glu Ala Gin Lys Gly Asn Phe Lys Asp Ala 

615 620 

Leu Glu Leu Leu Gly Ala Gly He Leu Leu Glu Phe Glu Pro Glu Leu 
630 635 640 

Leu He Pro Thr He Leu Val Phe Thr He Lys Ser Phe Leu Gly Ser 
^"^S 650 655 

ser Asp Asn Lys Asn Lys Val He Lys Ala He Asn Asn Ala Leu Lys 
660 665 670 

Glu Arg Asp Glu Lys Trp Lys Glu Val Tyr Ser Phe He Val Ser Asn 
^"^5 680 685 

Trp Met Thr Lys He Asn Thr Gin Phe Asn Lys Arg Lys Glu Gin Met 
690 695 700 

Tyr Gin Ala Leu Gin Asn Gin Val Asn Ala Leu Lys Ala He He Glu 

710 715 720 

Ser Lys Tyr Asn Ser Tyr Thr Leu Glu Glu Lys Asn Glu Leu Thr Asn 
725 730 735 

Lys Tyr Asp He Glu Gin He Glu Asn Glu Leu Asn Gin Lys Val Ser 
740 745 750 

He Ala Met Asn Asn He Asp Arg Phe Leu Thr Glu Ser Ser He Ser 
755 760 765 

Tyr Leu Met Lys Leu He Asn Glu Val Lys He Asn Lys Leu Arg Glu 
770 775 780 

Tyr Asp Glu Asn Val Lys Thr Tyr Leu Leu Asp Tyr He He Lys His 

790 795 800 

Gly Ser He Leu Gly Glu Ser Gin Gin Glu Leu Asn Ser Met Val He 
805 810 815 

Asp Thr Leu Asn Asn Ser He Pro Phe Lys Leu Ser Ser Tvr Thr Asp 
820 825 830 

Asp Lyo He Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys 
835 840 845 

Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tvr Val Asp 
850 855 860 

Thr Ser Gly Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys 

870 875 880 

Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 
885 890 895 

Glu Val Asn He Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lys Tyr 
900 905 910 

Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 

915 920 925 

Lys He Val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Ara 
930 935 940 

Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 
945 950 955 960 

Trp Thr Leu Gin Asp Asn Ser Gly He Asn Gin Lys Leu Ala Phe Asn 
965 970 975 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 
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980 985 990 

Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
995 1000 1005 

Gly Asn Leu He Asp Lys Lys Ser He Leu Asn Leu Gly Asn He His 
1010 1015 1020 

Val Ser Asp Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arg 
1025 1030 1035 1040 

Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
1045 1050 1055 

Thr Glu He Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn He Leu 
1060 1065 1070 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
1075 1080 1085 

Leu Asn val Leu Lys Pro Asn Asn Phe He Asn Arg Arg Thr Asp Ser 
1090 1095 1100 

Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg 
1105 1110 1115 1120 

Leu Tyr Ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
1125 1130 1135 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
1140 1145 1150 

val Ala Ser Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr 
1155 1160 1165 

Thr Asn Lys Glu Lyo Thr He Lys He Ser Ser Ser Gly Asn Arg Phe 
1170 1175 1180 

Asn Gin Val Val Val Met Asn Ser Val Gly Cys Thr Met Asn Phe Lys 
1185 1190 1195 1200 

Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala Asp Thr 
1205 1210 1215 

Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr Asn 
1220 1225 1230 

Ser Asn Gly Phe Phe Trp Asn Phe He Ser Glu Glu His Gly Trp Gin 
1235 1240 1245 

Glu Lys 
1250 

(2) INFORMATION FOR SEO ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 3 759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3756 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
ATG CCA AAA ATT AAT AGT TTT AAT TAT AAT GAT CCT GTT AAT GAT AGA 
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Met Pro Lys He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asp Arg 
5 10 15 

ACA ATT TTA TAT ATT AAA CCA GGC GGT TGT CAA GAA TTT TAT AAA TCA 

Thr He Leu Tyr He Lyo Pro Gly Gly Cys Gin Glu Phe Tyr Lys Ser 

20 25 So 



TTT AAT ATT ATG AAA AAT ATT TGG ATA ATT CCA GAG AGA AAT GTA ATT 
Phe Asn lie Met Lys Asn He Trp He He Pro Glu Arg Asn Val He 
35 40 45 



96 



GGT ACA ACC CCC CAA GAT TTT CAT CCG CCT ACT TCA TTA AAA AAT GGA 
Gly Thr Thr Pro Gin Asp Phe His Pro Pro Thr Ser Leu Lys Asn Glv 
^0 55 60 ^ 

GAT AGT AGT TAT TAT GAC CCT AAT TAT TTA CAA AGT GAT GAA GAA AAG 
Asp Ser ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp Glu Glu Lys 
" 75 80 

GAT AGA TTT TTA AAA ATA GTC ACA AAA ATA TTT AAT AGA ATA AAT AAT 
Asp Arg Phe Leu Lys He Val Thr Lys He Phe Asn Arg He Asn Asn 
85 90 95 

^ AAT CTT TCA GGA GGG ATT TTA TTA GAA GAA CTC TCA AAA GCT AAT CCA 

Asn Leu Ser Giy Giy He Leu Leu Glu Glu Leu Ser Lys Ala Asn Pro 
100 105 110 

TAT TTA GGG AAT GAT AAT ACT CCA GAT AAT CAA TTC CAT ATT GGT GAT 
Tyr Leu Gly Asn Asp Aon Thr Pro Asp Asn Gin Phe His He Gly Asp 

120 125 

OCA TCA GCA GTT GAG ATT AAA TTC TCA AAT GGT AGC CAA GAC ATA CTA 
Ala Ser Ala Val Glu He Lys Phe Ser Asn Gly Ser Gin Asp He Leu 
130 135 

TTA CCT AAT GTT ATT ATA ATG GGA GCA GAG CCT GAT TTA TTT GAA ACT 
Leu Pro Asn Vai He He Met Gly Ala Glu Pro Aso Leu Phe Glu Thr 

150 155 ■ 160 

AAC AGT TCC AAT ATT TCT CTA AGA AAT AAT TAT ATG CCA AGC AAT CAC 
Asn Ser Ser Asn He Ser Leu Arg Asn Asn Tyr Met Pro Ser Asn His 
165 170 175 

GGT TTT GGA TCA ATA GCT ATA GTA ACA TTC TCA CCT GAA TAT TCT TTT 576 
•♦.^ Gly Phe Gly Ser He Ala He Val Thr Phe Ser Pro Glu Tyr Ser »>he 
180 185 190 

AGA TTT AAT GAT AAT AGT ATG AAT GAA TTT ATT CAA GAT CCT GCT CTT 624 
Arg Phe Asn Asp Asn Ser Met Asn Glu Phe He Gin Asp Pro Ala Leu 
195 200 205 

ACA TTA ATG CAT GAA TTA ATA CAT TCA TTA CAT GGA CTA TAT GGG GCT 672 
Thr Leu Met His Glu Leu He His Ser Leu His Gly Leu Tyr Gly Ala 
210 215 220 
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AAA GGG ATT ACT ACA AAG TAT ACT ATA ACA CAA AAA CAA AAT CCC CTA 
Lys Gly He Thr Thr Lys Tyr Thr He Thr Gin Lys Gin Asn Pro Leu 
225 230 235 240 

ATA ACA AAT ATA AGA GGT ACA AAT ATT GAA GAA TTC TTA ACT TTT GGA 
He Thr Asn He Arg Gly Thr Asn He Glu Glu Phe Leu Thr Phe Gly 
245 250 255 

GGT ACT GAT TTA AAC ATT ATT ACT AGT GCT CAG TCC AAT GAT ATC TAT 816 
Gly Thr Asp Leu Asn He He Thr Ser Ala Gin Ser Asn Asp He Tyr 
260 265 270 

ACT AAT CTT CTA GCT GAT TAT AAA AAA ATA GCG TCT AAA CTT AGC AAA 864 
Thr Asn Leu Leu Ala Asp Tyr Lys Lys He Ala Ser Lvs Leu Ser Lys 
275 280 285 
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^'^'^ ^'^ TAT AAA GAT GTT TTT GAA 

?Qn '^^^ ^^^'^ ^^"^ "^y^ Lys Asp Val Phe Glu 

295 300 

GCA AAG TAT GGA TTA GAT AAA GAT GCT AGC GGA ATT TAT TCG GTA AAT 
Ala Lys Tyr Gly Leu Asp Lys Asp Ala Ser Gly He Tyr Ser Val Asn 

315 320 

ATA AAC AAA- TTT AAT GAT ATT TTT AAA AAA TTA TAG AGC TTT ACG GAA 
He Asn Lys Phe Asn Asp He Phe Lys Lys Leu Tyr Ser Phe Thr Glu 

330 335 

TTT GAT TTA GCA ACT AAA TTT CAA GTT AAA TGT AGG CAA ACT TAT ATT 
Phe Asp Leu Ala Thr Lys Phe Gin Val Lys Cys Arg Gin Thr Tyr He 
340 345 350 

S?^ ^ I^^ ^ ™^ '^^ ^ TCA AAC TTG TTA AAT GAT TCT ATT 

Gly Gin Tyr Lys Tyr Phe Lys Leu Ser Asn Leu Leu Asn Asp Ser lie 
355 360 365 

TAT AAT ATA TCA GAA GGC TAT AAT ATA AAT AAT TTA AAG GTA AAT TTT 
Tyr Asn He Ser Glu Gly Tyr Asn He Asn Asn Leu Lvs Val Asn Phe 

•>'0 375 

AGA GGA CAG AAT GCA AAT TTA AAT CCT AGA ATT ATT ACA CCA ATT ACA 
Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg He He Thr Pro He Thr 

390 395 

GGT AGA GGA CTA GTA AAA AAA ATC ATT AGA TTT TGT AAA AAT ATT GTT 
Gly Arg Gly Leu Val Lys Lys He He Arg Phe Cys Lys Asn He Val 
405 410 4i5 

TCT GTA AAA GGC ATA AGG AAA TCA ATA TGT ATC GAA ATA AAT AAT GGT 
ser Val Lys Gly He Arg Lys Ser He Cys He Glu He Asn Asn Gly 
420 425 430 

GAG TTA TTT TTT GTG GCT TCC GAG AAT AGT TAT AAT GAT GAT AAT ATA 
Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tyr Asn Asp Asp Asn He 
435 440 

AAT ACT CCT AAA GAA ATT GAC GAT ACA GTA ACT TCA AAT AAT AAT TAT 
Asn Thr Pro Lys Glu He Asp Asp Thr Val Thr Ser Asn Asn Asn Tyr 

455 460 

JAA AAT GAT TTA GAT CAG GTT ATT TTA AAT TTT AAT AGT GAA TCA GCA 
Glu Asn Asp Leu Asp Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 
'^^^ 470 475 

CCT GGA CTT TCA GAT GAA AAA TTA AAT TTA ACT ATC CAA AAT GAT GCT 
Pro Gly Leu Ser Asp Glu Lys Leu Asn Leu Thr He Gin Asn Asp Ala 
485 490 

TAT ATA CCA AAA TAT GAT TCT AAT GGA ACA AGT GAT ATA GAA CAA CAT 
Tyr He Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin His 
500 505 510 

GAT GTT AAT GAA CTT AAT GTA TTT TTC TAT TTA GAT GCA CAG AAA GTG 
Asp Val Asn Glu Leu Asn Val Phe Phe Tyr Leu Asp Ala Gin Lvs Val 
515 520 525 

CCC GAA GGT GAA AAT AAT GTC AAT CTC ACC TCT TCA ATT GAT ACA GCA 
Pro Glu Gly Glu Asn Asn Val Asn Leu Thr Ser Ser He Asp Thr Ala 
530 535 54Q 



TTA TTA GAA CAA CCT AAA ATA TAT ACA TTT TTT TCA TCA GAA TTT ATT 
Leu Leu Glu Gin Pro Lys He Tyr Thr Phe Phe Ser Ser Glu Phe He 
545 550 555 550 

AAT AAT GTC AAT AAA CCT GTG CAA GCA GCA TTA TTT GTA AGC TGG ATA 
Asn Asn Val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Trp He 
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565 570 575 

CAA CAA GTG TTA GTA GAT TTT ACT ACT GAA GCT AAC CAA AAA AGT ACT 1776 
Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 
580 585 590 

GTT GAT AAA ATT GCA GAT ATT TCT ATA GTT GTT CCA TAT ATA GGT CTT 1824 
Val Asp Lys He Ala Asp He Ser He Val Val Pro Tyr He Gly Leu 
595 600 605 

GCT TTA AAT ATA GGA AAT GAA GCA CAA AAA GGA AAT TTT AAA GAT GCA 1872 
Ala Leu Asn He Gly Asn Glu Ala Gin Lys Gly Asn Phe Lys Asp Ala 
610 615 620 



L'^ CTT GAA TTA TTA GGA GCA GGT ATT TTA TTA GAA TTT GAA CCC GAG CTT 

Leu Glu Leu Leu Gly Ala Gly He Leu Leu Glu Phe Giu Pro Glu Leu 
625 630 635 640 



1920 



TTA ATT CCT ACA ATT TTA GTA TTC ACG ATA AAA TCT TTT TTA GGT TCA 1968 

Leu He Pro Thr He Leu Val Phe Thr He Lys Ser Phe Leu Gly Ser 
645 650 655 

TCT GAT AAT AAA AAT AAA GTT ATT AAA GCA ATA AAT AAT GCA TTG AAA 2016 

Ser Asp Asn Lys Asn Lys Val He Lys Ala He Asn Asn Ala Leu Lys 
660 665 670 

GAA AG A GAT GAA AAA TGG AAA GAA GTA TAT AGT TTT ATA GTA TCG AAT 2064 

Glu Arg Asp Glu Lys Trp Lys Glu Val Tyr Ser Phe He Val Ser Asn 

675 680 685 

TGG ATG ACT AAA ATT AAT ACA CAA TTT AAT AAA AG A AAA GAA CAA ATG 2112 

Trp Met Thr Lys He Asn Thr Gin Phe Asn Lys Arg Lys Glu Gin Met 

690 695 700 



-i> TAT CAA GCT TTA CAA AAT CAA GTA AAT GCA ATT AAA ACA ATA ATA GAA 2160 

Tyr Gin Ala Leu Gin Asn Gin Val Asn Ala He Lys Thr He He Glu 
705 710 715 720 



TCT AAG TAT AAT AGT TAT ACT TTA GAG GAA AAA AAT GAG CTT ACA AAT 2208 

Ser Lys Tyr Asn Ser Tyr Thr Leu Glu Glu Lys Asn Glu Leu Thr Asn 
725 730 735 

AAA TAT GAT ATT AAG CAA ATA GAA AAT GAA CTT AAT CAA AAG GTT TCT 22 56 

Lys Tyr Asp He Lys Gin He Glu Asn Glu Leu Asn Gin Lys Val Ser 
740 745 750 

ATA GCA ATG AAT AAT ATA GAC AGG TTC TTA ACT GAA AGT TCT ATA TCC 23 04 

He Ala Met Asn Asn He Asp Arg Phe Leu Thr Glu Ser Ser He Ser 

755 760 765 

TAT TTA ATG AAA TTA ATA AAT GAA GTA AAA ATT AAT AAA TTA AG A GAA 2 352 

Tyr Leu Met Lys Leu He Asn Glu Val Lys He Asn Lys Leu Arg Glu 
770 775 780 

55 TAT GAT GAG AAT GTC AAA ACG TAT TTA TTG AAT TAT ATT ATA CAA CAT 24 00 

Tyr Asp Glu Asn Val Lys Thr Tyr Leu Leu Asn Tyr He He Gin His 

785 790 795 800 

GGA TCA ATC TTG GGA GAG AGT CAG CAA GAA CTA AAT TCT ATG GTA ACT 24 48 

OO Gly Ser He Leu Gly Glu Ser Gin Gin Glu Leu Asn Ser Met Val Thr 

805 810 815 

GAT ACC CTA AAT AAT AGT ATT CCT TTT AAG CTT TCT TCT TAT ACA GAT 24 96 

Asp Thr Leu Asn Asn Ser He Pro Phe Lys Leu Ser Ser Tyr Thr Asp 
820 825 830 



70 



GAT AAA ATT TTA ATT TCA TAT TTT AAT AAA TTC TTT AAG AGA ATT AAA 2 54 4 

Asp Lys He Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys 
835 840 845 
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AGT AGT TCA GTT TTA AAT ATG AGA TAT AAA AAT GAT AAA TAC GTA GAT 2592 
Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 
850 855 860 

ACT TCA GGA TAT GAT TCA AAT ATA AAT ATT AAT GGA GAT GTA TAT AAA 2640 
Thr Ser Gly Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys 
865 870 875 880 

TAT CCA ACT AAT AAA AAT CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT 2688 
Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 
885 890 895 

GAA GTT AAT ATA TCT CAA AAT GAT TAC ATT ATA TAT GAT AAT AAA TAT 2736 
Glu Val Asn He Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lys Tyr 
900 905 910 

AAA AAT TTT AGT ATT AGT TTT TGG GTA AGA ATT CCT AAC TAT GAT AAT 2784 
Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 
915 920 925 

AAG ATA GTA AAT GTT AAT AAT GAA TAC ACT ATA ATA AAT TGT ATG AGA 28 32 

Lys He Val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Ara 
930 935 940 

GAT AAT AAT TCA GGA TGG AAA GTA TCT CTT AAT CAT AAT GAA ATA ATT •?880 
Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 

950 955 960 

TGG ACA TTG CAA GAT AAT GCA GGA ATT AAT CAA AAA TTA GCA TTT AAC '>928 
Trp Thr Leu Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn 
965 970 975 

TAT GGT AAC GCA AAT GGT ATT TCT GAT TAT ATA AAT AAG TGG ATT TTT 2 976 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 
•^^ 980 985 990 
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GTA ACT ATA ACT AAT GAT AGA TTA GGA GAT TCT AAA CTT TAT ATT AAT 3024 
Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
995 1000 1005 

GGA AAT TTA ATA GAT CAA AAA TCA ATT TTA AAT TTA GGT AAT ATT CAT 3072 
Gly Asn Leu He Asp Gin Lys Ser He Leu Asn Leu Gly Asn He His 
1010 1015 1020 



GTT AGT GAC AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA 3120 
Val Ser Asp Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arq 
025 1030 1035 1040 



1 



TAT ATT GGT ATT AGA TAT TTT AAT ATT TTT GAT AAA GAA TTA GAT GAA 3168 
Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
1045 1050 1055 

ACA GAA ATT CAA ACT TTA TAT AGC AAT GAA CCT AAT ACA AAT ATT TTG 3216 
Thr Glu He Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn He Leu 
1060 1065 1070 

AAG GAT TTT TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA 3264 
Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tvr Tyr Leu 
1075 1080 1085 ' 

TTA AAT GTG TTA AAA CCA AAT AAC TTT ATT GAT AGG AGA AAA GAT TCT 3 312 

Leu Asn Val Leu Lys Pro Asn Asn Phe He Asp Arg Arg Lyc Asp Ser 
1090 1095 1100 

ACT TTA AGC ATT AAT AAT ATA AGA AGC ACT ATT CTT TTA GCT AAT AGA 3 36 0 

Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Ara 
1105 1110 1115 1120 

TTA TAT AGT GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT 34 08 

Leu Tyr Ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
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(2) INFORMATION FOR SEQ ID NO: 52: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1252 amino acidS 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Met Pro Lys He Asn Ser Phe Asn Tyr Asn Asp Pro Vai Aon Asp Arg 

1 5 10 — 



15 



Thr He Leu Tyr He Lys Pro Gly Gly Cys Gin Glu Phe Tyr Lys Ser 
20 25 30 

Phe Asn He Met Lys Asn He Trp He He Pro Glu Arg Asn Val He 
35 40 45 

Gly Thr Thr Pro Gin Asp Phe His Pro Pro Thr Ser Leu Lys Asn Glv 
50 55 60 

Aop Ser Ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp Glu Glu Lys 
60 '° 80 

Asp Arg Phe Leu Lys He Val Thr Lys He Phe Asn Arg He Asn Asn 
85 90 95 



Asn Leu Ser Gly Gly He Leu Leu Glu Glu Leu Ser Lys Ala Asn Pro 
100 105 110 

Tyr Leu Gly Asn Asp Asn Thr Pro Asp Asn Gin Phe His He Gly Asp 
115 120 125 

Ala Ser Ala Val Glu He Lys Phe Ser Asn Gly Ser Gin Asp He Leu 
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3504 
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Th3 ^ ^A'T GTA TAT ATT AAT TTT 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
^140 1145 1150 

Sit ^ ^^'^ '^'^ TTA TAT GCT GAT ACA GCT ACC 

val Ala Ser Lys Thr His Leu Phe Pro Leu Tyr Ala Asp Thr Ala Thr 

10 ^^^^ 1160 1165 

ACA AAT AAA GAG AAA ACA ATA AAA ATA TCA TCA TCT GGC AAT AGA TTT 
Thr Asn Lys Glu Lys Thr He Lys He Ser Ser Ser Gly ^n Arg l^e 

1175 1180 

'■^ ^'^^ ^"^^ GTA GGA AAT AAT TGT ACA ATG AAT 3600 

Asn Gin Val Val Val Met Asn Ser Val Gly Asn Asn Cys Thr Met Asn ° 

1190 1195 ^200 

P^I ^'^ ^"^T '^'TG TTA GGT TTC AAG GCA 3648 

-1' Phe Lys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala 

1205 1210 1215 

GAT ACT GTA GTT GCT AGT ACT TGG TAT TAT ACA CAT ATG AGA GAT CAT 3696 
Asp Thr val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp uTs 
1220 1225 1230 

THr ^""'^ S?"" ir ""^^ ATT TCT GAA GAA CAT GGA 3744 

Thr Asn Ser Asn Gly Cys Phe Trp Asn Phe He Ser Glu Glu His Glv 
1235 1240 1245 

TGG CAA GAA AAA TAA 

Trp Gin Glu Lys 3759 

1250 
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130 135 

Leu Pro Asn Val lie He Met Gly Ala Glu Pro Asp Leu Phe Glu Thr 
145 150 155 160 

Asn Ser Ser Asn He Ser Leu Arg Asn Asn Tyr Met Pro Ser Asn His 
165 170 175 

Gly Phe Gly Ser He Ala He Val Thr Phe Ser Pro Glu Tyr Ser Phe 
180 185 190 

Arg Phe Asn Asp Asn Ser Met Asn Glu Phe He Gin Asp Pro AJa Leu 
195 200 205 

Thr Leu Met His Glu Leu He His Ser Leu His Gly Leu Tyr Gly Ala 
210 215 220 

Lys Gly He Thr Thr Lys Tyr Thr He Thr Gin Lys Gin Asn Pro Leu 
225 230 235 240 

He Thr Asn He Arg Gly Thr Asn He Glu Glu Phe Leu Thr Phe Gly 
245 250 255 

Gly Thr Asp Leu Asn He He Thr Ser Ala Gin Ser Asn Asp He Tyr 
260 265 270 

Thr Asn Leu Leu Ala Asp Tyr Lys Lys He Ala Ser Lvo Leu Ser Lys 
275 280 285 

Val Gin Val Ser Asn Pro Leu Leu Asn Pro Tyr Lys Asp Val Phe Glu 
290 295 300 

Ala Lys Tyr Gly Leu Asp Lys Asp Ala Ser Gly He Tyr Ser Val Asn 
305 310 315 320 

He Asn Lys Phe Asn Asp He Phe Lys Lys Leu Tyr Ser Phe Thr Glu 
325 330 335 

Phe Asp Leu Ala Thr Lys Phe Gin Val Lys Cys Arg Gin Thr Tyr He 
340 345 350 

Gly Gin Tyr Lys Tyr Phe Lys Leu Ser Asn Leu Leu Asn Asp Ser He 
355 360 365 

Tyr Asn He Ser Glu Gly Tyr Asn He Asn Asn Leu Lys Val Asn Phe 
370 375 380 

Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg He He Thr Pro He Thr 
385 390 395 400 

Gly Arg Gly Leu Val Lys Lys He He Arg Phe Cys Lys Asn He Val 
405 410 415 

Ser Val Lys Gly He Arg Lys Ser He Cys He Glu He Asn Asn Gly 
420 425 430 

Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tyr Asn Asp Asp Asn He 
435 440 445 

Asn Thr Pro Lys Glu He Asp Asp Thr Val Thr Ser Asn Asn Asn Tyr 
450 455 460 

Glu Asn Asp Leu Asp Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 
465 470 475 480 

Pro Gly Lou Ser Asp Glu Lys Leu Asn Leu Thr He Gin Asn Asp Ala 
485 490 495 

Tyr He Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin His 
500 505 510 
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-Asp val Asn Glu Leu Asn Val Phe Phe Tyr Leu Asp Ala Gin Lys Val 

520 525 

Pro Glu Gly Glu Asn Asn Val Asn Leu Thr Ser Ser He Asp Thr Ala 

535 

Leu Leu Glu Gin Pro Lys lie Tyr Thr Phe Phe ser Ser Glu Phe He 

555 560 

Asn Asn val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Trp He 

570 

Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 
580 585 

Val Asp Lys lie Ala Asp He Ser He Val Val Pro Tyr He Gly Leu 

600 605 

Ala Leu Asn He Gly Asn Glu Ala Gin Lys Gly Asn Phe Lys Asp Ala 

615 620 

Leu Glu Leu Leu Gly Ala Gly He Leu Leu Glu Phe Glu Pro Glu Leu 

630 635 640 

Leu He Pro Thr He Leu Val Phe Thr He Lys Ser Phe Leu Gly Ser 

650 655 

Ser Asp Asn Lys Asn Lys Val He Lys Ala He Asn Asn Ala Leu Lvs 
660 665 670 

Glu Arg Asp Glu Lys Trp Lys Glu Val Tyr Ser Phe He Val Ser Asn 
675 6B0 685 

Trp Met Thr Lys He Asn Thr Gin Phe Asn Lys Arg Lys Glu Gin Met 

695 700 

Tyr Gin Ala Leu Gin Asn Gin Val Asn Ala He Lys Thr He He Glu 

■'lO 715 720 

Ser Lys Tyr Asn Ser Tyr Thr Leu Glu Glu Lys Asn Glu Leu Thr Asn 
725 730 735 

Lys Tyr Asp He Lys Gin He Glu Asn Glu Leu Asn Gin Lys Val Ser 
■'^O 745 750 

He Ala Met Asn Asn He Asp Arg Phe Leu Thr Glu Ser Ser He Ser 
■^55 760 765 

Tyr Leu Met Lys Leu He Asn Glu Val Lys He Asn Lys Leu Arg Glu 

775 780 

Tyr Asp Glu Asn Val Lys Thr Tyr Leu Leu Asn Tyr He He Gin His 

790 795 800 

Gly Ser He Leu Gly Glu Ser Gin Gin Glu Leu Asn Ser Met Val Thr 
805 810 815 

Asp Thr Leu Asn Asn Ser He Pro Phe Lys Leu Ser Ser Tyr Thr Asd 
820 825 830 

Asp Lys He Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys 
"35 840 845 

Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 

850 855 860 

Thr Ser Gly Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tvr Lvs 
865 870 875 880 

Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 

- 322 - 



wo 98/08540 



PCT/US97/153M 



885 890 895 

Glu Val Asn lie Ser Gin Asn Asp Tyr He lie Tyr Asp Asn Lys Tyr 
900 905 910 

Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 
915 920 925 

Lys lie val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arq 
^•*0 935 940 

Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 

950 955 ^ 960 

Trp Thr Leu Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn 
965 970 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 
980 985 990 

Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
995 1000 1005 

fn?n^^" '^^P ^^"^ Leu Gly Asn He His 

^010 1015 1020 

Voic^^^ ^""^ ^^"^ '^ai Asn Cys Ser Tyr Thr Arq 

1030 1035 X040 

Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
1045 1050 1055 

Thr Glu He Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn He Leu 
1060 1065 1070 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
1075 1080 1085 

^^"^ ^^^^."^^^ ^^"^ ^^"^ Asp Arg Arg Lys Asp Ser 

1090 1095 1100 

Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg 

mo 1115 i^Iq 

Leu Tyr Ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
1125 1130 1135 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
1140 1145 1150 

Val Ala Ser Lys Thr His Leu Phe Pro Leu Tyr Ala Asp Thr Ala Thr 
1155 1160 1165 

Thr Asn Lys Glu Lys Thr He Lys He Ser Ser Ser Gly Asn Arg Phe 
ll'O 1175 ^j^QQ 

Asn Gin Val Val Val Met Asn Ser Val Gly Asn Asn Cys Thr Met Asn 

1190 1195 1200 

Phe Lys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala 
1205 1210 1215 

Asp Thr Val val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp His 
1220 1225 1230 

Thr Asn Ser Asn Gly Cys Phe Trp Asn Phe He Ser Glu Glu His Glv 
1235 1240 1245 

Trp Gin Glu Lys 
1250 
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* (2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1463 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 



MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 
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lix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 108.. 1460 

Ixi) SEQUENCE DESCRIPTION; SEQ ID NO:53: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 164 
His His His His His His His His His Ser Ser Gly His lie Glu Gly 
5 10 15 

5^*^ S*^? f'J^ T^*^ "^^"^ GAT AAA ATT 212 

30 35 



wrtA U«l Mli 

Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asp Asp Lys lie 

20 25 *^ ' _ _ 



60 
116 



TTA ATT TCA TAT TTT AAT AAG TTC TTT AAG AGA ATT AAA AGT AGT TCT 260 
Leu lie Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys Ser Ser Ser 
40 45 50 

GTT TTA AAT ATG AGA TAT AAA AAT GAT AAA TAC GTA GAT ACT TCA GGA 308 
Val Leu Asn Met Arg Tyr Lye Asn Asp Lys Tyr val Asp Thr Ser Gly 
55 60 65 

TAT GAT TCA AAT ATA AAT ATT AAT GGA GAT GTA TAT AAA TAT CCA ACT 356 
Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys Tyr Pro Thr 
70 75 80 

AAT AAA AAT CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT GAA GTT AAT 4 04 

^> Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser Glu VaJ Asn 
85 90 95 

ATA TCT CAA AAT GAT TAC ATT ATA TAT GAT AAT AAA TAT AAA AAT TTT 4 52 

He Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lys Tyr Lys Asn Phe 
JOO 105 110 115 

AGT ATT AGT TTT TGG GTA AGA ATT CCT AAC TAT GAT AAT AAG ATA GTA 500 
Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn Lys He Val 
120 125 130 

AAT GTT AAT AAT GAA TAC ACT ATA ATA AAT TGT ATG AGG GAT AAT AAT 54 8 

Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arg Asp Asn Asn 
135 140 145 

TCA GGA TGG AAA GTA TCT CTT AAT CAT AAT GAA ATA ATT TGG ACA TTG 596 
Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He Trp Thr Leu 
150 155 160 

CAA GAT AAT TCA GGA ATT AAT CAA AAA TTA GCA TTT AAC TAT GGT AAC 64 4 

Gin Asp Asn Ser Gly He Asn Gin Lys Leu Ala Phe Asn Tyr Gly Asn 
165 170 175 

GCA AAT GGT ATT TCT GAT TAT ATA AAT AAG TGG ATT TTT GTA ACT ATA 6 92 

Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe Val Thr He 
n) 180 185 190 195 
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ACT AAT GAT AGA TTA GGA GAT TCT AAA CTT TAT ATT AAT GGA AAT TTA 74 0 

Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr lie Asn Gly Asn Leu 
200 205 210 

ATA GAT AAA AAA TCA ATT TTA AAT TTA GGT AAT ATT CAT GTT AGT GAG 788 
He Asp Lys Lys Ser He Leu Asn Leu Gly Asn He His Val Ser Asp 
215 220 225 

AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA TAT ATT GGT 836 
Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arg Tyr He Gly 
230 235 240 

ATT AGA TAT TTT AAT ATT TTT GAT AAA GAA TTA GAT GAA ACA GAA ATT 884 
He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu Thr Glu He 
245 250 255 

CAA ACT TTA TAT AAC AAT GAA CCT AAT GCA AAT ATT TTA AAG GAT TTT 
Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn He Leu Lys Asp Phe 
260 265 270 275 

TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA TTA AAT GTG 
Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu Leu Asn Val 
280 285 290 

TTA AAA CCA AAT AAC TTT ATT AAT AGG AGA ACA GAT TCT ACT TTA AGC 
Leu Lys Pro Asn Asn Phe He Asn Arg Arg Thr Aap Ser Thr Leu Ser 
295 300 305 

ATT AAT AAT ATA AGA AGC ACT ATT CTT TTA GCT AAT AGA TTA TAT AGT 1076 
He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg Leu Tyr Ser 
310 315 320 

GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT ACT AAC GAT 1124 
Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser Thr Asn Asp 

325 330 335 

AAT CTT GTT AGA AAG AAT GAT CAG GTA TAT ATT AAT TTT GTA GCC AGC 1172 
Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe Val Ala Ser 

345 350 355 

AAA ACT CAC TTA CTT CCA TTA TAT GCT GAT ACA GCT ACC ACA AAT AAA 1220 
Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr Thr Asn Lys 

360 365 370 

GAG AAA ACA ATA AAA ATA TCA TCA TCT GGC AAT AGA TTT AAT CAA GTA 1268 
Glu Lys Thr He Lys He Ser Ser Ser Gly Asn Arg Phe Asn Gin Val 
375 380 385 

GTA GTT ATG AAT TCA GTA GGA AAT TGT ACA ATG AAT TTT AAA AAT AAT 1316 
Val Val Met Asn Ser Val Gly Asn Cys Thr Met Asn Phe Lys Asn Asn 
390 395 400 

AAT GGA AAT AAT ATT GGG TTG TTA GGT TTC AAG GCA GAT ACT GTA GTT 
Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala Asp Thr Val Val 
405 410 415 

GCT AGT ACT TGG TAT TAT ACA CAT ATG AGA GAT AAT ACA AAC AGC AAT 
Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr Asn Ser Asn 
420 425 430 435 

GGA TTT TTT TGG AAC TTT ATT TCT GAA GAA CAT GGA TGG CAA GAA AAA 
Gly Phe Phe Trp Asn Phe He Ser Glu Glu His Gly Trp Gin Glu Lys 
440 445 450 

TAA 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 51 amino acids 



1364 



1412 



1460 



1463 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
^ 5 10 ^ 



15 



He Glu Gly Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr- Thr Asd 
20 25 30 

Asp Lys lie Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lvs 
35 40 45 

Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 

Thr Ser Gly Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys 

70 75 80 

Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 
85 90 95 

Glu Val Asn He Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lvs Tyr 
100 105 110 

Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 
115 120 125 

Lys He Val Asn Val Asn Asn Glu Tyr Thr He He Asn Cvs Met Arg 

135 140 

Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 

150 155 

Trp Thr Leu Gin Asp Asn Ser Gly He Asn Gin Lys Leu Ala Phe Asn 
165 170 175 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 
180 185 190 

Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 

200 205 

Gly Asn Leu He Asp Lys Lys Ser He Leu Asn Leu Gly Asn He His 
"10 215 220 

Val Ser Asp Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Ara 

230 235 240 

Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
245 250 255 

Thr Glu He Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn He Leu 
260 265 270 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
275 280 285 

Leu Asn Val Leu Lys Pro Asn Asn Phe He Asn Arg Arg Thr Asp Ser 

-90 295 300 

Thr Leu Ser He Asn Asn He Arg ser Thr He Leu Leu Ala Asn Arq 

315 320 

Leu Tyr Ser Gly He Lys Val Lys He Gin Arg Val Aon Asn Ser Ser 
325 330 335 
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Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
340 345 350 

Val Ala Ser Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr 

355 360 365 

Thr Asn Lys Glu Lys Thr He Lys lie Ser Ser Ser Gly Asn Arq Phe 
370 375 380 

Asn Gin Val Val Val Met Asn Ser Val G]y Asn Cys Thr Met Asn Phe 
385 390 395 400 

Lys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala Asp 
405 410 415 

Thr Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr 
420 425 430 

Asn Ser Asn Gly Phe Phe Trp Asn Phe lie Ser Glu Glu His Gly Trp 



10 



15 



35 



40 



60 



65 



440 445 



Gin Glu Lys 
450 



25 I?.) INFORMATION FOR SEQ ID NO: 55: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc « "DNA" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 108.. 1463 



{XI) SEQUENCE DESCRIPTION: SEQ ID NO:55: 
AC;ATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 60 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC cat 116 

Met Gly His 
1 

cat cat cat cat cat cat cat cat CAC AGC AGC GGC CAT ATC GAA GGT 164 
His His His His His His His His His Ser Ser Glv His He Glu Glv 
5 10 15 * 

CGT CAT ATG GCT AGC ATG GCT CTT TCT TCT TAT ACA GAT GAT AAA ATT 212 
Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asp Asp Lys He 
20 25 30 35 

TTA ATT TCA TAT TTT AAT AAA TTC TTT AAG AGA ATT AAA ACT AGT TCA 260 
Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys Ser Ser Ser 
40 45 50 

GTT TTA AAT ATG AGA TAT AAA AAT GAT AAA TAC GTA GAT ACT TCA GGA 308 
Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp Thr Ser G^y 
55 60 65 

TAT GAT TCA AAT ATA AAT ATT AAT GGA GAT GTA TAT AAA TAT CCA ACT 356 
Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys Tyr Pro Thr 
70 75 80 

AAT AAA AAT CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT GAA GTT AAT 4 04 

Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Sor Glu Val Asn 
85 90 95 
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lie IZ n^*^ '^'^'^ '^^'^ AAA TAT AAA AAT TTT 

lie ser Gin Asn Asp Tyr He He Tyr Asp Asn Lys Tyr Lys Asn Phe 
105 115 

ser H I f ir ^^"^ TAT GAT AAT AAG ATA GTA 

ser lie Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn Lys He Val 
120 125 130 

AAT GTT AAT AAT GAA TAC ACT ATA ATA AAT TGT ATG AGA GAT AAT AAT 
Asn val A3n Asn Glu Tyr Thr He He Asn Cys Met Arg P^p ^n 
l-*-> 140 145 

TCA GGA TGG AAA GTA TCT CTT AAT CAT AAT GAA ATA ATT TGG ACA TTG 
ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He lie Trp tS IIS 

155 160 

CAA GAT AAT GCA GGA ATT AAT CAA AAA TTA GCA TTT AAC TAT GGT AAC 
Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn Tyr Gil ^n 
" 170 

Al^ r7 I"^"" ^'"'^ ""^^ ATT TTT GTA ACT ATA 

Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe Val Thr lie 

190 

ACT AAT GAT AGA TTA GGA GAT TCT AAA CTT TAT ATT AAT GGA AAT TTA 
Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn Gly Asn Leu 
200 205 210 

ATA GAT CAA AAA TCA ATT TTA AAT TTA GGT AAT ATT CAT GTT AGT GAC 
He Asp Gin Lys Ser He Leu Asn Leu Gly Asn He His Val Ser Asd 
215 220 225 

AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA TAT ATT GGT 
Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arg Tvr He Gly 
230 235 240 ' 

ATT AGA TAT TTT AAT ATT TTT GAT AAA GAA TTA GAT GAA ACA GAA ATT 
He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu Thr Glu He 
245 250 255 

CAA ACT TTA TAT AGC AAT GAA CCT AAT ACA AAT ATT TTG AAG GAT TTT 
Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn He Leu Lys Asp Phe 
-60 265 270 275 

TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA TTA AAT GTG 
Trp Gxy Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu Leu Asn Val 
280 285 290 

TTA AAA CCA AAT AAC TTT ATT GAT AGG AGA AAA GAT TCT ACT TTA AGC 
Leu Lys Pro Asn Asn Phe He Asp Arg Arg Lys Asp Ser Thr Leu Ser 
295 300 305 

ATT AAT AAT ATA AGA AGC ACT ATT CTT TTA GCT AAT AGA TTA TAT AGT 
He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg Leu Tyr Ser 

315 320 

GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT ACT AAC GAT 
Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser Thr Asn Asp 

325 330 335 

AAT CTT GTT AGA AAG AAT GAT CAG GTA TAT ATT AAT TTT GTA GCC AGC 117 2 

Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe Val Ala Ser 

340 345 

AAA ACT CAC TTA TTT CCA TTA TAT GCT GAT ACA GCT ACC ACA AAT AAA 

Lys Thr His Leu Phe Pro Leu Tyr Ala Asp Thr Ala Thr Thr Asn Lvs 
360 365 370 

GAG AAA ACA ATA AAA ATA TCA TCA TCT GGC AAT AGA TTT AAT CAA GTA 
Glu Lys Thr He Lys He Ser Ser ser Gly Asn Arg Phe Asn Gin Val 



452 



500 



548 



596 



644 



692 



740 



788 



836 



884 



93: 



980 



1028 



1076 



1124 



1220 



1268 
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60 



Thr ser Gly Tyr Asp Ser Asn lie Asn He Asn Gly Asp Val Tyr Lys 

^ 75 80 

Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 
85 90 95 

Glu val Asn lie Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lys Tyr 

105 ^ 

Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 

120 125 

Lys He val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arg 
•^J" 135 140 

Asp Asn Asn ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 

ISO 155 

Trp Thr Leu Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn 

170 j^75 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 
180 185 190 

Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 



1316 



1364 



1460 



^'^^ 380 385 

vlt vJT t'^? ^'^^ AAT TGT ACA ATG AAT TTT AAA AAT 

val Val Met Asn Ser Val Gly Asn Asn Cys Thr Met Asn Phi ^ 

^95 400 

i^l oly tTI r° ACT GTA 

Asn Gly Asn Asn He Gly Leu Lou Gly Phe Lys Ala Asp Thr Val 

410 415 

-II III ^tl T^. l'''^ ^'^T '^TG AGA GAT CAT ACA AAC AGC 1412 

-al Ala ser Thr Trp Tyr Tyr Thr His Met Arg Asp His Thr Asn Ser 
425 435 

AAT GGA TGT TTT TGG AAC TTT ATT TCT GAA GAA CAT GGA TCr raa rna 
Asn Gly cys Phe Trp Asn Phe He Ser gJS gJ? S?I ??y 

445 

AAA TAAAAGCTT 

Lys 1472 
(2) INFORMATION FOR SEO ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 52 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Met Giy His Hir, His His His His His His His His Ser Ser Gly His 
5 10 15 

lie Glu Gly Arg Hia Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asp 

20 25 30 

Asp Lys lie Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys 
35 40 45 

Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 
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10 



15 



Gly Asn Leu lie Asp Gin Lys Ser He Leu Asn Leu Gly Asn He His 
210 215 220 

Val Ser Asp Asn He Leu Phe Lys He Val Asn Cya Ser Tyr Thr Arg 
225 230 235 240 

Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
245 250 255 

Thr Glu He Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn He Leu 
260 265 270 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
275 280 285 

Leu Asn Val Leu Lys Pro Asn Asn Phe He Asp Arg Arg Lvc Asp Ser 
290 295 300 

Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Aon Arg 
-(^ 305 310 315 320 

Leu Tyr Ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
325 330 335 

2.^ Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
340 345 350 

Val Ala Ser Lys Thr His Leu Phe Pro Leu Tyr Ala Asp Thr Ala Thr 
... 355 360 365 

.>() 

Thr Asn Lys Glu Lys Thr He Lys He Ser Ser Ser Gly Asn Arg Phe 
370 375 380 

Asn Gin Val Val Val Met Asn Ser Val Gly Asn Asn Cys Thr Met Asn 
•^•^ 385 390 395 400 



40 



45 



50 



Phe Lys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala 
405 410 415 

Asp Thr Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp His 
420 425 430 

Thr Asn Ser Asn Gly Cys Phe Trp Asn Phe He Ser Glu Glu His Gly 
435 440 445 

Trp Gin Glu Lys 
4 50 

(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

CGCCATGGCT CTTTCTTCTT ATACAGATGA T 31 

(2) INFORMATION FOR SEQ ID NO: 58: 

65 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
70 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

. <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GCAAGCTTTT ATTTTTCTTG CCATCCATG 

(2) INFORMATION FOR SEQ ID NO: 59: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3876 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



15 



25 



30 



3.^ 



40 



50 



60 



65 



(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..3873 

SEQUENCE DESCRIPTION: SEQ ID NO:59: 



ATG CCA ATA ACA ATT AAC AAC TTT AAT TAT TCA GAT CCT GTT GAT AAT 4 8 

Met Pro He Thr He Asn Asn Phe Asn Tyr Ser Asp Pro Val Asp Asn 
^ ^ 10 15 



AAA AAT ATT TTA TAT TTA GAT ACT CAT TTA AAT ACA CTA GCT AAT GAG 
Lys Aan He Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn Glu 
20 25 30 



96 



CCT GAA AAA GCC TTT CGC ATT ACA GGA AAT ATA TGG GTA ATA CCT GAT 144 
Pro Glu Lys Ala Phe Arg He Thr Gly Asn He Trp Val He Pro Asp 
35 40 45 

AGA TTT TCA AGA AAT TCT AAT CCA AAT TTA AAT AAA CCT CCT CGA GTT 192 
Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys Pro Pro Arq Val 
50 55 60 

ACA AGC CCT AAA ACT GGT TAT TAT GAT CCT AAT TAT TTG ACT ACT GAT 240 
Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp 
65 70 75 80 

TCT GAC AAA GAT ACA TTT TTA AAA GAA ATT ATA AAG TTA TTT AAA AGA 288 
Ser Asp Lys Asp Thr Phe Leu Lys Glu He He Lvs Leu Phe Lys Arg 
85 90 * 95 



ATT AAT TCT AGA GAA ATA GGA GAA GAA TTA ATA TAT AGA CTT TCG ACA 336 
He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr Arg Leu Ser Thr 
100 105 HO 

GAT ATA CCC TTT CCT GGG AAT AAC AAT ACT CCA ATT AAT ACT TTT GAT 384 
Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro He Asn Thr Phe Asp 
115 120 125 

TTT GAT GTA GAT TTT AAC AGT GTT GAT GTT AAA ACT AGA CAA GGT AAC 4 32 

Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr Arq Gin Gly Asn 
130 135 140 

AAC TGG GTT AAA ACT GGT AGC ATA AAT CCT AGT GTT ATA ATA ACT GGA 4 80 

Asn Trp Val Lys Thr Gly Ser He Asn Pro Ser Val He He Thr Gly 

150 155 160 

CCT AGA GAA AAC ATT ATA GAT CCA GAA ACT TCT ACG TTT I\AA TTA ACT 5-^8 
Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr phe Lys Leu Thr 
165 170 175 
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AAC AAT ACT TTT GCG GCA CAA GAA GGA TTT GGT GCT TTA TCA ATA ATT 
Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala Leu Ser lie lie 

185 

TCA ATA TCA CCT AGA TTT ATG CTA ACA TAT AGT AAT GCA ACT AAT GAT 
ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn Asp 
195 200 205 

GTA GGA GAG GGT AGA TTT TCT AAG TCT GAA TTT TGC ATG GAT CCA ATA 
Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys Met Asp Pro He 

215 220 

CTA ATT TTA ATG CAT GAA CTT AAT CAT GCA ATG CAT AAT TTA TAT GGA 
Leu He Leu Met His Glu Leu Asn His Ala Met His Asn Leu Tyr Gly 
230 235 240 

ATA GCT ATA CCA AAT GAT CAA ACA ATT TCA TCT GTA ACT AGT AAT ATT 
lie Ala He Pro Asn Asp Gin Thr He Ser Ser Val Thr Ser Asn He 
245 250 255 

TTT TAT TCT CAA TAT AAT GTG AAA TTA GAG TAT GCA GAA ATA TAT GCA 
Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala Glu He Tyr Ala 
260 265 270 

TTT GGA GGT CCA ACT ATA GAC CTT ATT CCT AAA AGT GCA AGG AAA TAT 
Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser Ala Arg Lys Tyr 

280 285 

TTT GAG GAA AAG GCA TTG GAT TAT TAT AGA TCT ATA GCT AAA AGA CTT 
Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser He Ala Lys Arg Leu 
290 295 300 

A.^T AGT ATA ACT ACT GCA AAT CCT TCA AGC TTT AAT AAA TAT ATA GGG 
Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr He Gly 

^10 315 320 

GAA TAT AAA CAG AAA CTT ATT AGA AAG TAT AGA TTC GTA GTA GAA TCT 
Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe Val Val Glu Ser 
325 330 335 

TCA GGT GAA GTT ACA GTA AAT CGT AAT AAG TTT GTT GAG TTA TAT AAT 
Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val Glu Leu Tyr Asn 
340 345 350 

GAA CTT ACA CAA ATA TTT ACA GAA TTT AAC TAC GCT AAA ATA TAT AAT 1104 
Glu Leu Thr Gin He Phe Thr Glu Phe Asn Tyr Ala Lvs lie Tyr Asn 

355 360 365 

GTA CAA AAT AGG AAA ATA TAT CTT TCA AAT GTA TAT ACT CCG GTT ACG 1152 
Val Gin Asn Arg Lys He Tyr Leu Ser Asn Val Tyr Thr Pro Val Thr 
370 375 380 

GCG AAT ATA TTA GAC GAT AAT GTT TAT GAT ATA CAA AAT GGA TTT AAT 1200 
Ala Asn He Leu Asp Asp Asn Val Tyr Asp He Gin Asn Gly Phe Asn 

390 395 

ATA CCT AAA AGT AAT TTA AAT GTA CTA TTT ATG GGT CAA AAT TTA TCT 
He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gin Aon Leu Ser 
405 410 4j^5 

CGA AAT CCA GCA TTA AGA AAA GTC AAT CCT GAA AAT ATG CTT TAT TTA 
Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn Met Leu Tyr Leu 
420 425 430 

TTT ACA AAA TTT TGT CAT AAA GCA ATA GAT GGT AGA TCA TTA TAT AAT 
Phe Thr Lys Phe Cys His Lys Ala He Asp Gly Arg Ser Leu Tyr Asn 
435 440 445 

AAA ACA TTA GAT TGT AGA GAG CTT TTA GTT AAA AAT ACT GAC TTA CCC 
Lys Thr Leu Asp Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1246 



1296 



1344 



139: 
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450 455 460 

TTT ATA GGT GAT ATT AGT GAT GTT AAA ACT GAT ATA TTT TTA AG A AAA 144 0 

Phe lie Gly Asp He Ser Asp Vai Lys Thr Asp He Phe Leu Arg Lys 
465 470 475 480 

GAT ATT AAT GAA GAA ACT GAA GTT ATA TAC TAT CCG GAC AAT GTT TCA 1488 
Asp He Asn Glu Glu Thr Glu Val He Tyr Tyr Pro Asp Asn Val Ser 
485 490 495 

GTA GAT CAA GTT ATT CTC AGT AAG AAT ACC TCA GAA CAT GGA CAA CTA 1536 
Val Asp Gin Vai He Leu Ser Lys Asn Thr Ser Glu His Gly GJ.n Leu 
500 505 510 

15 GAT TTA TTA TAC CCT AGT ATT GAC AGT GAG AGT GAA ATA TTA CCA GGG 1584 

Asp Leu Leu Tyr Pro Ser He Asp Ser Glu Ser Glu He Leu Pro Gly 
515 520 525 

GAG AAT CAA GTC TTT TAT GAT AAT AGA ACT CAA AAT GTT GAT TAT TTG 1632 
20 Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu 
530 535 540 

AAT TCT TAT TAT TAC CTA GAA TCT CAA AAA CTA AGT GAT AAT GTT GAA 1680 
Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu 
2:> 545 550 555 560 



30 



50 



60 



GAT TTT ACT TTT ACG AGA TCA ATT GAG GAG GCT TTG GAT AAT AGT GCA 1728 

Asp Phe Thr Phe Thr Arg Ser He Glu Glu Ala Leu Asp Asn Ser Ala 

565 570 575 

AAA GTA TAT ACT TAC TTT CCT ACA CTA GCT AAT AAA GTA AAT GCG GGT 1776 

Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly 

580 585 590 



.O GTT CAA GGT GGT TTA TTT TTA ATG TGG GCA AAT GAT GTA GTT GAA GAT 1824 

Val Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp Val Val Glu Asp 

595 600 605 

TTT ACT ACA AAT ATT CTA AGA AAA GAT ACA TTA GAT AAA ATA TCA GAT 1872 

40 Phe Thr Thr Asn He Leu Arg Lys Asp Thr Leu Asp Lys He Ser Asp 
610 615 620 

GTA TCA GCT ATT ATT CCC TAT ATA GGA CCC GCA TTA AAT ATA AGT AAT 1920 

Val Ser Ala He He Pro Tvr He Gly Pro Ala Leu Asn He Ser Asn 

625 630 635 640 



TCT GTA AGA AGA GGA AAT TTT ACT GAA GCA TTT GCA GTT ACT GGT GTA 1968 
Ser Vai Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Glv Val 
645 650 655 

ACT ATT TTA TTA GAA GCA TTT CCT GAA TTT ACA ATA CCT GCA CTT GGT 2016 
Thr He Leu Leu Glu Ala Phe Pro Glu Phe Thr He Pro Ala Leu Gly 

660 665 670 

55 GCA TTT GTG ATT TAT AGT AAG GTT CAA GAA AGA AAC GAG ATT ATT AAA 2064 

Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Asn Glu He He Lys 

675 680 685 



ACT ATA GAT AAT TGT TTA GAA CAA AGG ATT AAG AGA TGG AAA GAT TCA 2112 
Thr He Asp Asn Cys Leu Glu Gin Arg He Lvs Arg Trp Lys Asp Ser 
690 ' 695 ' 700 



TAT GAA TGG ATG ATG GGA ACG TGG TTA TCC AGG ATT ATT ACT CAA TTT 216 0 

Tyr Glu Trp Met Met Gly Thr Trp Leu Ser Arg He He Thr Gin Phe 
6> fOS 710 715 720 



70 



AAT AAT ATA AGT TAT CAA ATG TAT GAT TCT TTA AAT TAT CAG GCA GGT 2 208 

Asn Asn He Ser Tyr Gin Met Tyr Asp Ser Leu Asn Tyr Gin Ala Gly 
725 730 735 
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25 



•15 



60 



65 



lla ill ^ ^ XT'" ^ ^ TAT TCA GGA AGT 
Ala He Lys Ala Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser 
740 ' 



750 



aS ^ G^ jI" ^ ^^"^ ^T ■^'^ ^ AAT AGT TTA 

Asp Lys Glu Asn He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu 
755 760 765 



GAA TGT TCC GTA ACA TAT TTA TTT AAA AAT ATG TTA CCT AAA GTA ATT 
|. Glu Cyo ser Val Thr Tyr Leu Phe Lys Asn Met Leu p7o Hi iH 

795 800 

GAT GAA TTA AAT GAG TTT GAT CGA AAT ACT AAA GCA AAA TTA ATT AAT 
ASP Glu Leu Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu ill ^n 
20 815 

CTT ATA GAT AGT CAT AAT ATT ATT CTA GTT GOT GAA GTA GAT AAA TTA 
Leu lie ASP ser His Asn He He Leu Val Gly Glu Val lip ^ Uen 

825 830 

AAA GCA AAA GTA AAT AAT AGC TTT CAA AAT ACA ATA CCC TTT AAT ATT 
Lys Ala Lys Val Asn Asn Ser Phe Gin Asn Thr He Pro Phe ill 
835 840 845 



Gly Asp Val Gin Leu Asn Pro He Phe Pro Phe Asp Phe Lys Leu Gly 
900 905 sio 

AGT TCA GGG GAG GAT AGA GGT AAA GTT ATA GTA ACC CAG AAT GAA AAT 
Ser Ser Gly Glu Asp Arg Gly Lys Val He Val Thr Gin Asn Glu Asn 

915 920 



AGA ATA AAT AAA TGG GTA AGT AAT TTA CCT GGA TAT ACT ATA ATT GAT 
Arg He Asn Lys Trp Val Ser Aon Leu Pro Gly Tyr Thr He He Asp 

950 ggg 

AGT GTT AAA AAT AAC TCA GGT TGG AGT ATA GGT ATT ATT AGT AAT TTT 
Ser Val Lys Asn Asn Ser Gly Trp Ser He Gly He He Ser Asn Phe 
965 970 975 

Jl^, ir ^^"^ '^^ ^ ^'^ °AT AGT GAA CAA AGT ATA AAT 

Leu Val Phe Thr Leu Lys Gin Asn Glu Asp Ser Glu Gin Ser He Asn 
980 985 990 

in I™ ^7* l^^ GGA TAC AAT AAA TGG TTT 

Asn 1 
1005 

" -..w «-»*w ww\ rtrti rtio AAG All I'AT ATA 

Phe Val Thr Val Thr Asn Asn Met Met Gly Asn Met Lyc He Tyr He 



™. ^^j^-^. ^j^j^ j^j^ ^ 

Phe Ser Tyr Asp He Ser Asn Asn Ala Pro Gly Tyr Asn Lys Trp Phe 
99S 1000 



2256 



2304 



If) A^*^ w'^f ^ '^^^ ATG AAT AAT ATA AAT AAA TTT ATA CGA 

10 Asp val Lys He Ser Glu Ala Met Asn Asn He Asn Lys Phi lie ArJ 

775 780 



2400 



2448 



2496 



2544 



^0 Phe Ser Itl ThI I" 7™ ^"^^ ^'^ TAT 2592 

Phe Sei Tyr Thr Asn Asn Ser Leu Leu Lys Asp He He Asn Glu Tyr 

855 860 

Ph^ ^'^ ^'^ ^^'^ '^^^ ^ ^'^'^ 'TTG AGC CTA CAA AAC AGA AAA 

35 865 ""^^ 870 ^^"^ "^^^ ^^"^ IjI ""^^ ^ 

AAT ACT TTA GTG GAT ACA TCA GGA TAT AAT GCA GAA GTG AGT GAA GAA 

Asn Thr Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val Ser Glu 
40 890 895 

GGC GAT GTT CAG CTT AAT CCA ATA TTT CCA TTT GAC TTT AAA TTA GGT 



2640 



2688 



2736 



2784 



Sn tT'^ l^'^ ^'^^ '^^'^ ^AA AGT TTT AGC ATT AGT TTT TGG ATT 2832 

He Val Tyr Asn Ser Met Tyr Glu Ser Phe Ser He Ser Phe Trp lie 

^•30 935 



2880 



2928 



2976 



3024 



70 SI? SIT ^'l ^! tl? tl2 19^. ^1 ^7.? ATA 
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1010 1015 



1020 



35 



1165 



^"^^ '^^'^ ^^■^ AAT CAT AGT ACT GAA GAT ATA TAT OCT 

4^ Asn Giu Thr Met Tyr Ala Asp Asn His Ser Thr Glu AsJ ne lyl All 

^190 1195 1200 

ATA GGT TTA AGA GAA CAA ACA AAG GAT ATA AAT GAT AAT ATT ATA TTT 
He Gly Leu Arg Glu Gin Thr Lys Asp lie Asn AsJ lie lie Pl!I 

50 1210 1215 

iJt '^'T ACT TAT TAT TAC GCA TCT CAA ATA TTT 

Gin lie Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala Ser Gin lie Phe 
1220 1225 1230 

AAA TCA AAT TTT AAT GGA GAA AAT ATT TCT GGA ATA TOT TCA ATA GGT 
Lys Ser Asn Phe Asn Gly Glu Asn He Ser Gly Ue CyJ ser lie ^y 
1^^5 1240 1245 



v"^? f^"^ AAT TAT GCT TCA TTA TTA GAA TCA ACA 



1270 1275 



1280 



3120 



3168 



3216 



Gli ^ lIu n - ^"""^ ""T* AAA GAA CTA ACT GGA ATT 

5 1025 ^ ?n?n^'*'' "® l-e" Thr Gly He 

1035 1040 

AAT TTT AGC AAA ACT ATA ACA TTT GAA ATA AAT AAA ATT CCA GAT ACC 
Asn Phe ser Lys Thr He Thr Phe Glu lie Asn Lyr. iVe Pro Asp Thr 
10 1°50 1055 

Gly IIu ill ThI Ter SI I" t"'' '^^^ ^°<= ^TA AGA GAT 

Gly Leu He Thr Ser Asp Ser Asp Asn He Asn Met Trp He Arg Asp 

1065 1070 

15 TTT TAT ATA TTT GCT AAA GAA TTA GAT GGT AAA GAT ATT AAT ATA TTA 

Phe Tyr He Phe Ala Lys Glu Leu Asp Gly Lys Asp hI He llu 

^^^^ 1080 1085 

TTT AAT AGC TTG CAA TAT ACT AAT GTT GTA AAA GAT TAT rrr rm Ann. 
20 Phe Asn ser Leu Gin ^r Thr Asn Val Val Z ^Tyl IT, oTy 

1035 1100 

GAT TTA AGA TAT AAT AAA GAA TAT TAT ATG GTT AAT ATA GAT TAT TTA 

1115 1120 

^n Aro ?Cr SI? t''^ ^"^'^ '^AA ATT GTT TTT AAT ACA CGT 

A.n Aig Tyr Met Tyr Ala Asn Ser Arg Gin He Val Phe Asn Thr Arg 
■50 1125 1130 1135 

AGA AAT AAT AAT GAC TTC AAT GAA GGA TAT AAA ATT ATA ATA AAA ACa 
Arg Asn Asn Asn Asp Phe Asn Glu Gly Tyr Lys S Ue hJ iJS J?g 
11*0 1145 1150 ^ 

H » Av^ n^'' i'S* *AT GAT ACT AGA GTA CGA GGA GGA GAT ATT TTA 

He Aig Gly A3n Thr Asn Asp Thr Arg Val Arg Gly Glv Asp III Leu 

llbb 1160 



3360 



3408 



3456 



3504 



TAT TTT GAT ATG ACA ATT AAT AAC AAA GCA TAT AAT TTr TTT zi'-r aiv^ -.^.^ 
Tyr Phe ASP Met Thr He Asn Asn ^ Ala lyl ^n llu III Met H^t 

^^■^5 1X80 



3600 



3648 



3696 



3744 



M ^u'^ l^'^ ^^'^ "'^ GGT GAT TGG TAT AGA CAC AAT TAT TTG 

IVso""'^ ""^^ "-^^ ^^-^ leu ' " 

■^"^^^ 1255 1260 



3840 



70 



TCA ACT CAT TGG GGT TTT GTA CCT GTA AGT GAA TAA 

Ser Thr His Trp Gly Phe Val Pro Val Ser Glu ^^^^ 
1285 1290 
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(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

I A) LENGTH: 12 91 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:60: 

Met Pro lie Thr He Asn Asn Phe Asn Tyr Ser Asp Pro Val Asp Asn 
1 5 10 15 

Lys Asn He Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn Glu 
20 25 30 

Pro Glu Lys Ala Phe Arg He Thr Gly Asn He Trp Val He Pro Aso 
35 40 45 

Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys Pro Pro Arg Val 
50 55 60 

Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp 

70 75 80 

Ser Asp Lys Asp Thr Phe Leu Lys Glu He He Lys Leu Phe Lys Arg 
85 90 95 

He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr Arg Leu Ser Thr 
100 105 110 

Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro He Asn Thr Phe Asp 
115 120 125 

Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr Arg Gin Gly Asn 
130 135 140 

Asn Trp Val Lys Thr Gly Ser He Asn Pro Ser Val He He Thr Gly 
145 150 155 160 

Pro Arg Glu Asn lie He Asp Pro Glu Thr Ser Thr Phe Lys Leu Thr 

165 170 175 

Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala Leu Ser lie He 
180 185 190 

Ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn Asp 
195 200 205 

Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys Met Asp Pro He 
210 215 220 

Leu He Leu Met His Glu Leu Asn His Ala Met His Asn Leu Tyr Gly 
225 230 235 240 

He Ala He Pro Asn Asp Gin Thr He Ser Ser Val Thr Ser Asn He 
245 250 255 

Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala Glu He Tyr Ala 

260 265 270 

Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser Ala Arg Lys Tvr 
275 280 285 

Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser He Ala Lys Arg Leu 
290 295 300 

Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr He Gly 
•J05 310 315 320 
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Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe Val Val Glu Ser 

325 330 335 

Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val Glu Leu Tyr Asn 
340 345 350 

Glu Leu Thr Gin He Phe Thr Glu Phe Asn Tyr Ala Lys He Tyr Asn 
355 360 365 

Val Gin Asn Arg Lyt; He Tyr Leu Ser Asn Val Tyr Thr Pro Val Thr 
370 375 380 

Ala Asn He Leu Asp Asp Asn Val Tyr Asp He Gin Asn Gly Phe Asn 
385 390 395 400 

He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gin Asn Leu Ser 
405 410 415 

Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn Met Leu Tyr Leu 
420 425 430 

Phe Thr Lys Phe Cys His Lys Ala He Asp Gly Arg Ser Leu Tyr Asn 
435 440 ■ 445 

Lys Thr Leu Asp Cys Arg Giu Leu Leu val Lys Asn Thr Asp Leu Pro 
•150 455 460 

Phe He Gly Asp He Ser Asp Val Lys Thr Asp He Phe Leu Arg Lys 
•565 470 475 480 

Asp He Asn Glu Glu Thr Glu Val He Tyr Tyr Pro Asp Asn Val Ser 
485 490 495 

Val Asp Gin val He Leu Ser Lys Asn Thr Ser Glu His Gly Gin Leu 
500 505 510 

Asp Leu Leu Tyr Pro Ser He Asp Ser Glu Ser Glu He Leu Pro Gly 
515 520 525 

Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu 
530 535 540 

Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu 
545 550 555 560 

Asp Phe Thr Phe Thr Arg Ser He Glu Glu Ala Leu Asp Asn Ser Ala 

565 570 575 

Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly 
580 585 590 

Val Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp Val Val Glu Asp 

595 600 605 

Phe Thr Thr Asn He Leu Arg Lys Asp Thr Leu Asp Lys He Ser Asp 
610 615 620 

Val Ser Ala He He Pro Tyr He Gly Pro Ala Leu Asn He Ser Asn 
625 630 635 640 

Ser Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val 
645 650 655 

Thr He Leu Leu Glii Ala Phe Pro Glu Phe Thr He Pro Ala Leu Gly 
660 665 670 

Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Aon Glu He He Lys 
675 680 685 

Thr He Asp Asn Cys Leu Glu Gin Arg He Lys Arg Trp Lys Asp Ser 
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690 695 



700 



Tyr Glu Trp Met Met Gly Thr Trp Leu Ser Arg He He Thr Gin Phe 

710 715 720 

Asn Asn He Ser Tyr Gin Met Tyr Asp Ser Leu Asn Tyr Gin Ala Gly 
725 730 735 

Ala He Lys Ala Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser 
740 745 750 

Asp Lys Glu Asn He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu 
755 760 765 

Asp Val Lys He Ser Glu Ala Met Asn Asn He Asn Lys Phe He Arq 

770 775 780 

Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val He 
785 790 



795 800 



Asp Glu Leu Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu He Asn 

810 815 

Leu He Asp Ser His Asn He He Leu Val Gly Glu Val Asp Lys Leu 
820 825 830 

Lys Ala Lys Val Aon Asn Ser Phe Gin Asn Thr He Pro Phe Asn He 
835 840 845 

Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp He He Asn Glu Tyr 
850 855 860 

Phe Asn Asn He Asn Asp Ser Lys He Leu Ser Leu Gin Asn Arg Lvs 
865 870 875 880 

Asn Thr Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val Ser Glu Glu 
885 890 895 

Gly Asp Val Gin Leu Asn Pro He Phe Pro Phe Asp Phe Lys Leu Gly 
900 905 910 

Ser Ser Gly Glu Asp Arg Gly Lys Val He Val Thr Gin Asn Glu Asn 
915 920 925 

He Val Tyr Asn Ser Met Tyr Glu Ser Phe Ser He Ser Phe Trp He 
930 935 940 

Arg He Asn Lys Trp Val Ser Asn Leu Pro Gly Tyr Thr He He Asp 
945 950 955 ggg 

Ser Val Lys Asn Asn Ser Gly Trp Ser He Gly He He Ser Asn Phe 
965 970 975 

Leu Val Phe Thr Leu Lys Gin Asn Glu Asp Ser Glu Gin Ser He Asn 
980 985 990 

Phe Ser Tyr Asp He Ser Asn Asn Ala Pro Gly Tyr Asn Lvs Trp Phe 
995 1000 1005 

Phe Val Thr Val Thr Asn Asn Met Met Gly Asn Met Lys He Tyr He 
1010 1015 1020 

Asn Gly Lys Leu He Asp Thr He Lys Val Lvs Glu Leu Thr Gly He 
1025 1030 1035 1040 

Asn Phe Ser Lys Thr He Thr Phe Glu He Asn Lys He Pro Asp Thr 
1045 1050 1055 

Gly Leu He Thr Ser Asp Ser Asp Asn He Asn Met Trp He Arg Asp 
1060 1065 1070 
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Phe Tyr He Phe Ala Lys Glu Leu Asp Gly Lys Asp He Asn He Leu 
1075 1080 1085 

Phe Asn Ser Leu Gin Tyr Thr Asn Val val Lys Asp Tyr Trp Gly Asn 
1090 1095 1100 

Asp Leu Arg Tyr Asn Lys Glu Tyr Tvr Met Val Asn He Asp Tyr Leu 
1105 1110 1115 1120 

Asn Arg Tyr Met Tyr Ala Asn Ser Arg Gin He Val Phe Asn Thr Arg 
1125 1130 1135 

Arg Asn Asn Asn Asp Phe Asn Glu Gly Tyr Lys He He He Lys Arg 
1140 1145 1150 

He Arg Gly Asn Thr Asn Asp Thr Arg Val Arg Gly Gly Asp He Leu 
1155 1160 1165 

Tyr Phe Asp Met Thr He Asn Asn Lys Ala Tyr Asn Leu Phe Met Lys 
1170 1175 1180 

Asn Glu Thr Met Tyr Ala Asp Asn His Ser Thr Glu Asp He Tyr Ala 
1185 1190 1195 1200 

He Gly Leu Arg Glu Gin Thr Lys Asp He Aan Asp Asn He He Phe 
1205 1210 1215 

Gin He Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala Ser Gin He Phe 
1220 1225 1230 

Lys Ser Asn Phe Asn Gly Glu Asn He Ser Gly He Cys Ser He Gly 
1235 1240 1245 

Thr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Tyr Arg His Asn Tyr Leu 
1250 1255 1260 

Val Pro Thr Val Lys Gin Gly Asn Tyr Ala Ser Leu Leu Glu Ser Thr 
1265 1270 1275 1280 

Ser Thr Hio Trp Gly Phe Val Pro Val Ser Glu 
1285 1290 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

Ux) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 108.. 1493 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 

His His His His His His His His His Ser Ser Gly His He Glu Gly 
5 10 15 

CGT CAT ATG GCT AGC ATG GCT TTA TTA AAA GAT ATA ATT AAT GAA TAT 
Arg His Met Ala Ser Met Ala Leu Leu Lys Asp He He Asn Glu Tyr 
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2° 25 30 35 

Jhe Jtn f f ^^^^ ^^^^ ^^A AAC AGA AAA 

Phe Asn Asn He Asn Asp Ser Lys lie Leu Ser Leu Gin Asn Arg Lys 
40 45 50 

AAT ACT TTA GTG GAT ACA TCA GGA TAT AAT GCA GAA GTG AGT GAA GAA 
Asn Thr Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val Ser Gul 

60 65 

GGC GAT GTT CAG CTT AAT CCA ATA TTT CCA TTT GAC TTT AAA TTA GOT 
Gly ASP Val Gin Lou Asn Pro He Phe Pro Phe Asp pJe ^ llu Ty 
" 75 80 

^tl l?" ^""^ ^^"^ GTT ATA GTA ACC CAG AAT GAA AAT 

ser ser Gly Glu Asp Arg Gly Lys Val He Val Thr Gin Asn Glu A^n 
85 90 95 

tTJ SI^ ^'^^ '^'^ '^'^^ TTT AGC ATT AGT TTT TGG ATT 

lie val Tyr Asn Ser Met Tyr Glu Ser Phe Ser He Ser Phe Trp lie 

115 

AGA ATA AAT AAA TGG GTA AGT AAT TTA CCT GGA TAT ACT ATA ATT GAT 
Arg lie Asn Lys Trp Val Ser Asn Leu Pro Gly Tyr Thr lie lie AsJ 
i20 130 

^« vll ^ I""" ^'^ ^^"^ ^"^^ '^TT ATT AGT AAT TTT 

Ser \,al Lys Asn Asn Ser Gly Trp Ser He Gly He He Ser Asn Phe 
■'^•'5 140 145 

lIu vI^" JhI AAT GAA GAT AGT GAA CAA AGT ATA AAT 

Leu Val Phe Thr Leu Lys Gin Asn Glu Asp Ser Glu Gin Ser He Asn 

155 160 

III c^'^ l""^ ^^"^ ^T'^ "^^^ ^'^ <=<^ GGA TAC AAT AAA TGG TTT 

Phe Ser Tyr Asp He Ser Asn Asn Ala Pro Gly Tyr Asn Lys Trp Phe 
J''^ 170 175 

Phi ul^ ?^ t'E^ '^'^ AAT ATG AAG ATT TAT ATA 

Phe Vdl Thr Val Thr Asn Aon Met Met Gly Asn Met Lys He Tyr He 

185 190 195 

AAT GGA AAA TTA ATA GAT ACT ATA AAA GTT AAA GAA CTA ACT GGA ATT 
Asn Gly Lys Leu He Asp Thr He Lys Val Lys Glu Leu Thr Gly He 
200 205 210 

AAT TTT AGC AAA ACT ATA ACA TTT GAA ATA AAT AAA ATT CCA GAT ACC 
Acn Phe Ser Lys Thr He Thr Phe Glu He Asn Lys He Pro Asp Thr 
215 220 225 

II?, tT^ ^h"^ l^" '^^ f""^^ ATG TGG ATA AGA GAT 

^^"^ ^''P Asn Met Trp He Arg Asp 

-JO 235 240 

in tT* ^^"^ ^ '^'^^ GGT AAA GAT ATT AAT ATA TTA 

Phe Tyr He Phe Ala Lys Glu Leu Asp Gly Lys Asp He Asn He Leu 

•^45 250 255 

TTT AAT AGC TTG CAA TAT ACT AAT GTT GTA AAA GAT TAT TGG GGA AAT 
Phe Asn ser Leu Gin Tyr Thr Asn Val Val Lys Asp Tyr Trp Gly Asn 

265 270 275 

GAT TTA AGA TAT AAT AAA GAA TAT TAT ATG GTT AAT ATA GAT TAT TTA 
Asp Leu Arg Tyr Asn Lys Glu Tyr Tyr Met Val Asn He Asp Tyr Leu 

280 285 290 

AAT AGA TAT ATG TAT GCG AAC TCA CGA CAA ATT GTT TTT AAT ACA CGT 
Asn Arg Tyr Met Tyr Ala Asn Ser Arg Gin He Val Phe Asn Thr Arg 
295 300 305 
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AGA AAT AAT AAT GAC TTC AAT GAA GGA TAT AAA ATT ATA ATA AAA AGA 1076 
Arg Asn Asn Asn Asp Phe Asn Glu Gly Tyr Lys He He He Lys Arg 
310 315 320 

ATC AGA GGA AAT ACA AAT GAT ACT AGA GTA CGA GGA GGA GAT ATT TTA 1124 
He Arg Gly Asn Thr Asn Asp Thr Arg Val Arg Gly Gly Asp He Leu 
325 330 335 

TAT TTT GAT ATG ACA ATT AAT AAC AAA GCA TAT AAT TTG TTT ATG AAG 117-^ 
Tyr Phe Asp Met Thr He Asn Asn Lys Ala Tyr Asn Leu Phe Met Lys 

345 350 355 

AAT GAA ACT ATG TAT GCA GAT AAT CAT AGT ACT GAA GAT ATA TAT GCT i'>20 
Asn Giu Thr Met Tyr Ala Asp Asn His Ser Thr Glu Asp He Tyr Ala 
360 365 370 

ATA GGT TTA AGA GAA CAA ACA AAG GAT ATA AAT GAT AAT ATT ATA TTT 
He Gly Leu Arg Glu Gin Thr Lys Asp He Asn Asp Asn He He Phe 
375 380 385 

CAA ATA CAA CCA ATG AAT AAT ACT TAT TAT TAC GCA TCT CAA ATA TTT 
Gin He Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala Ser Gin He Phe 
390 395 400 

AAA TCA AAT TTT AAT GGA GAA AAT ATT TCT GGA ATA TGT TCA ATA GGT 1364 
Lys Ser Asn Phe Asn Gly Glu Asn He Ser Gly He Cvs Ser He Glv 
-105 410 415 ' 

ACT TAT CGT TTT AGA CTT GGA GGT GAT TGG TAT AGA CAC AAT TAT TTG 1412 
Thr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Tyr Arg His Asn Tyr Leu 
-120 425 430 435 

GTG CCT ACT GTG AAG CAA GGA AAT TAT GCT TCA TTA TTA GAA TCA ACA 1460 
^ Val Pro Thr Val Lys Gin Gly Asn Tvr Ala Ser Lou Leu Glu S»*r Thr 

-^'^ 440 445 

TCA ACT CAT TGG GGT TTT GTA CCT GTA AGT GAA TAAAAGCTT 1502 
ser Thr His Trp Gly Phe Val Pro Val Ser Glu 
455 460 



10 



15 



20 



30 



40 



50 



60 



(2) INFORMATION FOR SEQ ID NO: 62: 



ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 4 62 amino acids 
•^•'^ (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:62: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
^5 10 15 

He Glu Gly Arg His Met Ala Ser Met Ala Leu Leu Lys Asp He He 
-0 25 30 

Asn Glu Tyr Phe Asn Asn He Asn Asp Ser Lys He Leu Ser Leu Gin 
35 40 45 

Asn Arg Lys Asn Thr Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val 
50 55 60 

Ser Glu Glu Gly Asp Val Gin Leu Asn Pro He Phe Pro Phe Asp Phe 
65 70 75 80 

Lys Leu Gly Ser Ser Gly Glu Asp Arg Gly Lys Val He Val Thr Gin 
85 90 95 

70 Asn Glu Asn He Val Tyr Asn Ser Met Tyr Glu Ser Phe Ser He Ser 

. 341 . 



1268 



1316 
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100 105 110 

Phe Trp He Arg He Asn Lys Trp Val Ser Asn Leu Pro Gly Tyr Thr 
115 120 125 

He He Asp Ser Val Lys Asn Asn Ser Gly Trp Ser He Gly He He 
130 135 140 

Ser Asn Phe Leu Val Phe Thr Leu Lys Gin Asn Glu Asp Ser Glu Gin 

150 155 160 

Ser He Asn Phe Ser Tyr Asp He Ser Asn Asn Ala Pro Gly Tyr Asn 
165 170 175 

Lys Trp Phe Phe Val Thr Val Thr Asn Asn Met Met Glv Asn Met Lyc 
180 185 ' 190 

He Tyr He Asn Gly Lys Leu He Asp Thr He Lys Val Lys Glu Leu 
195 200 205 

Thr Gly He Asn Phe Ser Lyc Thr He Thr Phe Glu He Asn Lys He 
210 215 220 

Pro Asp Thr Gly Leu He Thr Ser Asp Ser Asp Asn He Asn Met Trp 

230 235 240 

He Arg Asp Phe Tyr He Phe Ala Lys Glu Leu Asp Gly Lys Asp He 
245 250 255 

Asn He Leu Phe Asn Ser Leu Gin Tyr Thr Asn Val Val Lys Asp Tvr 
260 265 270 

Trp Gly Asn Asp Leu Arg Tyr Asn Lys Glu Tyr Tyr Met Val Asn He 
275 280 285 

Asp Tyr Leu Asn Arg Tyr Met Tyr Ala Asn Ser Arg Gin He Val Phe 
290 295 300 

Asn Thr Arg Arg Aon Asn Asn Asp Phe Asn Glu Gly Tyr Lys He He 
305 310 315 320 

He Lyo Arg He Arg Gly Asn Thr Asn Asp Thr Arg Val Arq Gly Glv 
325 330 335 

Asp He Leu Tyr Phe Asp Met Thr Ho Asn Asn Lys Ala Tyr Asn Leu 
340 345 350 

Phe Met Lys Asn Glu Thr Met Tyr Ala Asp Asn His Ser Thr Glu Asp 
355 360 365 

He Tyr Ala He Gly Leu Arg Glu Gin Thr Lys Asp He Asn Asp Asn 
370 375 380 

He He Phe Gin He Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala Ser 

^85 390 395 400 

Gin He Phe Lys Ser Asn Phe Asn Gly Glu Asn He Ser Gly He Cys 
405 410 415 

Ser He Gly Thr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Tyr Arg His 
420 425 430 

Asn Tyr Leu Val Pro Thr Val Lys Gin Gly Asn Tyr Ala Ser Leu Leu 
435 440 445 

Glu Ser Thr Ser Thr His Trp Gly Phe Val Pro Val Ser Glu 
450 455 460 

(2) INFORMATION FOR SEQ ID NO: 63: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

CGCCATGGCT TTATTAAAAG ATATAATTAA TG 

{2) INFORMATION FOR SEQ ID NO: 64: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc «= "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

GCAAGCTTTT ATTCACTTAC AGGTACAAAA CC 



(2) INFORMATION FOR SEQ ID NO: 65: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE; 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 3828 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

ATG ACA TGG CCA GTA AAA GAT TTT AAT TAT AGT GAT CCT GTT AAT GAC 
Met Thr Trp Pro Val Lys Asp Phe Asn Tyr Ser Asp Pro Vai Asn Asp 

- 5 10 15 

AAT GAT ATA TTA TAT TTA AGA ATA CCA CAA AAT AAG TTA ATT ACT ACA 
Asn Asp lie Leu Tyr Leu Arg He Pro Gin Asn Lys Leu He Thr Thr 

20 25 30 

CCT GTA AAA GCT TTT ATG ATT ACT CAA AAT ATT TGG GTA ATA CCA GAA 
Pro Val Lys Ala Phe Met He Thr Gin Asn He Trp Val He Pro Glu 
35 40 45 

AGA TTT TCA TCA GAT ACT AAT CCA AGT TTA AGT AAA CCG CCC AGA CCT 
Arg Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lyc Pro Pro Arg Pro 

50 55 60 

ACT TCA AAG TAT CAA AGT TAT TAT GAT CCT AGT TAT TTA TCT ACT GAT 
Thr Ser Lys Tyr Gin Ser Tyr Tyr Asp Pro Ser Tyr Leu Ser Thr Asp 
65 70 75 80 

GAA CAA AAA GAT ACA TTT TTA AAA GGG ATT ATA AAA TTA TTT AAA AGA 
Glu Gin Lyc Asp Thr Phe Leu Lys Gly He He Lys Leu Phe Lys Arg 
85 90 95 



ATT AAT GAA AGA GAT ATA GGA AAA AAA TTA ATA AAT TAT TTA GTA GTT 
lie Asn Glu Arg Asp He Gly Lys Lys Leu He Asn Tyr Leu Val Val 
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100 105 



110 



15 



.10 



35 



40 



50 



AAA TAT AAA AAA ATA TTT TCT GAA AAG TAT AAT TTT GAT AAA GAT AAT 
Lys Tyr Lys Lys He Phe Ser Glu Lys Tyr Asn Phe Asp Lys Asp Asn 
325 330 



384 



43: 



480 



528 



GlJ Ifr PrI HI M^^ r?"" """^ '^^^ ^^"^ ACA TTT GAT 

. Gly Ser Pro Phe Met Gly Asp Ser Ser Thr Pro Glu Asp Thr Phe Asp 
liS 120 125 

TTT ACA CGT CAT ACT ACT AAT ATT GCA GTT GAA AAG TTT GAA AAT GGT 
Phe Thr Arg His Thr Thr Asn He Ala Val Glu Lys Phe Glu Asn Gil 
10 135 140 

AGT TGG AAA GTA ACA AAT ATT ATA ACA CCA AGT GTA TTG ATA TTT GGA 
Ser Trp Lys Val Thr Asn He He Thr Pro Ser Val Leu lie cty 

150 155 j^gj 

CCA CTT OCT AAT ATA TTA GAC TAT ACA GCA TCC CTT ACA TTG CAA GGA 
Pro Leu Pro Asn lie Leu Asp Tyr Thr Ala Ser Leu Thr Leu Glv 
165 170 

-»() "^^^ '^^'^ TTT GGA ACA TTA TCT ATA CTA <;•»« 

-0 Gin Gin ser Asn Pro Ser Phe Glu Gly Phe Gly Thr Leu Ser lie Lei 
18° 185 190 

^ vlt D*^"^ '^■''^ ^^'^ °AT GTA ACA TCT AAT 

^. Lys Val Ala Pro Glu Phe Leu Leu Thr Phe Ser Asp Val Thr Ser Asn 
155 200 205 

CAA AGT TCA GCT GTA TTA GGC AAA TCT ATA TTT TGT ATG GAT CCA GTA 
" ^er ser Ala Val Leu Gly Lys Ser lie Phe Cys Met Asp Pro Val 
-1" 215 220 

ATA GCT TTA ATG CAT GAG TTA ACA CAT TCT TTG CAT CAA TTA TAT GGA 7-0 
lie Ala Leu Met His Glu Leu Thr His Ser Leu His Gin Lou Tyr Gly 

230 235 240 

ATA AAT ATA CCA TCT GAT AAA AGG ATT CGT CCA CAA GTT AGC GAG GGA 768 
He Asn lie Pro Ser Asp Lys Arg He Arg Pro Gin Val ser Glu ciy 
24S 250 255 

III 11^ l^"^ ^^"^ ^'f'^ °TA CAA TTT GAG GAA TTA TAT ACA 816 

Phe Phe ser Gin Asp Gly Pro Asn Val Gin Phe Glu Glu Leu Tyr Thr 
260 265 270 

D^"^ S?^ '^'^ ^^"^ ^"^"^ *TA ATA CCT CAA ATT GAA AGA TCA CAA 864 

^. Phe Gly Gly Leu Asp Val Glu He He Pro Gin He Glu Arg Ser Gin 



624 



672 



280 285 

TTA AGA GAA AAA GCA TTA GGT CAC TAT AAA GAT ATA GCG AAA AGA CTT 912 

Leu Arg Glu Lys Ala Leu Gly His Tyr Lys Asp He Ala Lys Arg Leu 

290 295 300 

AAT AAT ATT AAT AAA ACT ATT CCT TCT AGT TGG ATT AGT AAT ATA GAT 960 

Asn Asn He Asn Lys Thr He Pro Ser Ser Trp He Ser Asn He Asp 

3X0 315 320 



1008 



ACA GGA AAT TTT GTT GTA AAT ATT GAT AAA TTC AAT AGC TTA TAT TCA i nc;«: 

Thr Gly Asn Phe Val Val Asn He Asp Lys Phe A^n Ser Uu T^i sly 
340 345 350 
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10 



15 



20 



30 



40 



45 



55 



60 



65 



70 



GAC TTG ACT AAT GTT ATG TCA GAA GTT GTT TAT TCT TCG CAA TAT AAT 1104 
Asp Leu Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gin Tyr Asn 
355 360 365 

GTT AAA AAC AGG ACT CAT TAT TTT TCA AGG CAT TAT CTA CCT GTA TTT 1152 
Val Lys Asn Arg Thr His Tyr Phe Ser Arg His Tyr Leu Pro Val Phe 
370 375 380 

GCA AAT ATA TTA GAT GAT AAT ATT TAT ACT ATA AG A GAT GGT TTT AAT 1200 
Ala Asn lie Leu Asp Asp Asn lie Tyr Thr He Arg Asp Gly Phe Asn 
385 390 395 400 

TTA ACA AAT AAA GGT TTT AAT ATA GAA AAT TCG GGT CAG AAT ATA GAA 1248 
Leu Thr Asn Lyc Gly Phe Asn He Glu Asn Ser Gly Gin Asn He Glu 
405 410 415 

AGG AAT CCT GCA CTA CAA AAG CTT AGT TCA GAA AGT GTA GTA GAT TTA 1296 
Arg Asn Pro Ala Leu Gin Lys Leu Ser Ser Glu Ser Val Val Asp Leu 
420 425 430 

TTT ACA AAA GTA TGT TTA AGA TTA ACA AAA AAT AGT AGA GAT GAT TCA 1344 
Phe Thr Lys Val Cys Leu Arg Leu Thr Lys Asn Ser Arg Asp Asp Ser 
435 440 445 

ACA TGT ATT AAA GTT AAA AAT AAT AGA TTA CCT TAT GTA GCT GAT AAA 13 92 

Thr Cys He Lys Val Lys Asn Asn Arg Leu Pro Tyr Val Ala Asp Lys 
450 455 460 

GAT AGC ATT TCA CAA GAA ATA TTT GAA AAT AAA ATT ATT ACA GAT GAG 144 0 

Asp Ser He Ser Gin Glu He Phe Glu Asn Lys He He Thr Asp Glu 
465 470 475 48O 

ACT AAT GTA CAA AAT TAT TCA GAT AAT TTT TCA TTA GAT GAA TCT ATT 1488 
Thr Acn Val Gin Asn Tyr Ser Asp Asn Phe Ser Leu Asp Glu Ser He 
485 490 495 

TTA GAT GGG CAA GTT CCT ATT AAT CCT GAA ATA GTA GAT CCA CTA TTA 1536 
Leu Asp Gly Gin Val Pro He Asn Pro Glu He Val Asp Pro Leu Leu 

500 505 510 

CCC AAT GTT AAT ATG GAA CCT TTA AAT CTT CCA GGT GAA GAA ATA GTA 1584 
Pro Asn Val Asn Met Glu Pro Leu Asn Leu Pro Gly Glu Glu He Val 

515 520 525 

TTT TAT GAT GAT ATT ACT AAA TAT GTT GAT TAT TTA AAT TCT TAT TAT 1632 
Phe Tyr Aop Asp He Thr Lys Tyr Val Asp Tyr Leu Asn Ser Tvr Tvr 
530 535 540 



TAT TTG GAA TCT CAA AAA TTA AGT AAT AAT GTT GAA AAT ATT ACT CTT 1680 
Tyr Leu Glu Ser Gin Lys Leu Ser Asn Asn Val Glu Asn He Thr Leu 
545 550 555 560 



ACA ACT TCA GTT GAA GAA GCA TTA GGT TAT AGC AAT AAG ATA TAC ACA 1728 
Thr Thr Ser Val Glu Glu Ala Leu Gly Tyr Ser Asn Lys He Tyr Thr 
565 570 575 

TTT TTA CCT AGC TTA GCT GAA AAA GTG AAT AAA GGT GTT CAA GCA GGT 17 76 

Phe Leu Pro Ser Leu Ala Glu Lys Val Asn Lys Gly Val Gin Ala Gly 
580 585 590 

TTA TTC TTA AAT TGG GCG AAT GAA GTA GTT GAG GAT TTT ACT ACA AAT 1824 
Leu Phe Leu Asn Trp Ala Asn Glu Val Val Glu Asp Phe Thr Thr Asn 
595 600 605 

ATT ATG AAG AAA GAT ACA TTG GAT AAA ATA TCA GAT GTA TCA GTA ATA 1872 
He Met Lys Lys Asp Thr Leu Asp Lys He Ser Asp Val Ser Val He 
610 615 620 

ATT CCA TAT ATA GGA CCT GCC TTA AAT ATA GGA AAT TCA GCA TTA AGG 1920 
He Pro Tyr He Gly Pro Ala Leu Asn He Gly Asn Ser Ala Leu Arg 
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630 

GGA AAT TTT AAG CAA GCA TTT GCA ACA GCT GGT GTA GCT TTT TTA TTA 

Gly Asn Phe Lys Gin Ala Phe Ala Thr Ala Gly val Ala Phe Leu Leu 

650 655 

GAG GGA TTT CCA GAG TTT ACT ATA CCT GCA CTC GGT GTA TTT ACC TTT 
Glu Gly Phe Pro Glu Phe Thr lie Pro Ala Leu Gly Val Phe Thr Phe 
660 665 670 

Itl ^T. I^"^ ^ ^'^T AAA ACT ATA GAA AAT 

Tyr Ser Ser He Gin Glu Arg Glu Lys He He Lys Thr He dlu Asn 
675 680 685 

Cvl U?. '^^^ ^ TCA TAT CAA TGG ATG 

Cys Leu Glu Gin Arg Val Lys Arg Trp Lys Asp Ser Tyr Gin Trp Met 

695 700 

GTA TCA AAT TGG TTG TCA AGA ATT ACT ACT CAA TTT AAT CAT ATA AAT 
Val ser Asn Trp Leu Ser Arg He Thr Thr Gin Phe Asn His He Asn 

''lO 715 720 

Itl MI^ '^'"'^ '^^'^ '^^'^ GCA ATC AAA GCT 

Tyr Gin Met Tyr Asp Ser Leu Ser Tyr Gin Ala Asp Ala He Lys Ala 
725 730 735 

AAA ATA GAT TTA GAA TAT AAA AAA TAC TCA GGA AGT GAT AAA GAA AAT 
Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp Lys Glu Asn 
■^40 745 750 

ATA AAA AGT CAA GTT GAA AAT TTA AAA AAT AGT TTA GAT GTA AAA ATT 
He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu Asp Val Lys He 
"^^5 760 765 

TCG GAA GCA ATG AAT AAT ATA AAT AAA TTT ATA CGA GAA TGT TCT GTA 
ser Glu Ala Met Asn Asn He Asn Lys Phe He Arg Glu Cys Ser Val 

770 775 780 

ACA TAC TTA TTT AAA AAT ATG CTC CCT AAA GTA ATT GAC GAA TTA AAT 
Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val He Asp Glu Leu Asn 

790 795 BOO 

AAG TTT GAT TTA AGA ACT AAA ACA GAA TTA ATT AAT CTT ATA GAT AGT 2448 
Lys Phe Asp Leu Arg Thr Lys Thr Glu Leu He Asn Leu He Asp Ser 
805 810 815 

CAT AAT ATT ATT CTA GTT GGT GAA GTA GAT AGA TTA AAA GCA AAA GTA 
His Asn He He Leu Val Gly Glu Val Asp Arg Leu Lvs Ala Lys Val 
820 825 * 830 

AAT GAG AGT TTT GAA AAT ACA ATG CCT TTT AAT ATT TTT TCA TAT ACT 
Asn Glu Ser Phe Glu Asn Thr Met Pro Phe Asn He Phe Ser Tyr Thr 
835 840 845 

AAT AAT TCT TTA TTA AAA GAT ATA ATT AAT GAA TAT TTC AAT AGT ATT 2592 
Asn Asn Ser Leu Leu Lys Asp He He Asn Glu Tyr Phe Asn Ser He 
850 855 860 

AAT GAT TCA AAA ATT TTG AGC TTA CAA AAC AAA AAA AAT GCT TTA GTG 264 0 

Asn Asp Ser Lys He Leu Ser Leu Gin Asn Lys Lys Asn Ala Leu Val 
865 870 875 880 

GAT ACA TCA GGA TAT AAT GCA GAA GTG AGG GTA GGA GAT AAT GTT CAA 
Asp Thr Ser Gly Tyr Asn Ala Glu Val Arg Val Gly Asp Asn Val Gin 
885 890 895 

CTT AAT ACG ATA TAT ACA AAT GAC TTT AAA TTA AGT AGT TCA GGA GAT 
Leu Asn Thr He Tyr Thr Asn Asp Phe Lys Leu Ser Ser Ser Gly asd 
900 905 910 



1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



2400 



2496 



2544 



2688 



2736 
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2928 



2976 



3024 



AAA ATT ATA GTA AAT TTA AAT AAT AAT ATT TTA TAT AGC GCT ATT TAT 2784 
Lys lie He Val Asn Leu Asn Asn Asn He Leu Tyr Ser Ala He Tyr 
915 920 925 

GAG AAC TCT AGT GTT AGT TTT TGG ATT AAG ATA TCT AAA GAT TTA ACT 2832 
Glu Asn Ser Ser Val Ser Phe Trp He Lys He Ser Lys Asp Leu Thr 
930 935 940 

AAT TCT CAT AAT GAA TAT ACA ATA ATT AAC AGT ATA GAA CAA AAT TCT 2880 
Asn Ser His Asn Glu Tyr Thr He He Asn Ser He Glu Gin Asn Ser 

950 955 960 

GGG TGG AAA TTA TGT ATT AGG AAT GGC AAT ATA GAA TGG ATT TTA CAA 
Gly Trp Lys Leu Cys He Arg Acn Gly Asn He Glu Trp He Leu Gin 
965 970 975 

GAT GTT AAT AGA AAG TAT AAA AGT TTA ATT TTT GAT TAT AGT GAA TCA 
Asp Val Asn Arg Lys Tyr Lys Ser Leu He Phe Asp Tyr Ser Glu Ser 
980 985 990 

TTA AGT CAT ACA GGA TAT ACA AAT AAA TGG TTT TTT GTT ACT ATA ACT 
Leu Ser His Thr Gly Tyr Thr Asn Lys Trp Phe Phe Val Thr He Thr 
995 1000 1005 

AAT AAT ATA ATG GGG TAT ATG AAA CTT TAT ATA AAT GGA GAA TTA AAG 3072 
Asn Asn He Met Gly Tyr Met Lys Leu Tyr He Asn Gly Glu Leu Lys 
-Olf^ 1015 1020 

CAG AGT CAA AAA ATT GAA GAT TTA GAT GAG GTT AAG TTA GAT AAA ACC 3120 
Gin Ser Gin Lys He Glu Asp Leu Asp Glu Val Lys Leu Asp Lys Thr 
1025 1030 1035 1040 

ATA GTA TTT GGA ATA GAT GAG AAT ATA GAT GAG AAT CAG ATG CTT TGG 3X6 8 

He Val Phe. Gly He Asp Glu Asn He Asp Glu Asn Gin Met Leu Trp 
1045 1050 1055 

ATT AGA GAT TTT AAT ATT TTT TCT AAA GAA TTA AGT AAT GAA GAT ATT 3216 
He Arg Asp Phe Asn He Phe Ser Lys Glu Leu Ser Asn Glu Asp He 
1060 1065 1070 

AAT ATT GTA TAT GAG GGA CAA ATA TTA AGA AAT GTT ATT AAA GAT TAT 3264 
Asn He Val Tyr Glu Gly Gin He Leu Arg Asn Val He Lys Asp Tyr 
1075 1080 1085 

TGG GGA AAT CCT TTG AAG TTT GAT ACA GAA TAT TAT ATT ATT AAT GAT 3312 
Trp Gly Asn Pro Leu Lys Phe Asp Thr Glu Tyr Tyr lie He Asn Asp 
1090 1095 1100 

AAT TAT ATA GAT AGG TAT ATT GCA CCT GAA AGT AAT GTA CTT GTA CTT 3 360 

Asn Tyr He Asp Arg Tyr He Ala Pro Glu Sor Asn Val Leu Val Leu 
1105 1110 1115 1120 

GTT CGG TAT CCA GAT AGA TCT AAA TTA TAT ACT GGA AAT CCT ATT ACT 34 08 

Val Arg Tyr Pro Asp Arg Ser Lys Leu Tyr Thr Gly Asn Pro He Thr 
1125 1130 1135 

ATT AAA TCA GTA TCT GAT AAG AAT CCT TAT AGT AGA ATT TTA AAT GGA 34 56 

He Lys Ser Val Ser Asp Lys Asn Pro Tyr Ser Arg He Leu Asn Gly 
1140 1145 1150 

GAT AAT ATA ATT CTT CAT ATG TTA TAT AAT AGT AGG AAA TAT ATG ATA 3 504 

Asp Asn He He Leu His Met Leu Tyr Asn Ser Arg Lys Tyr Met He 
1155 1160 1165 

ATA AGA GAT ACT GAT ACA ATA TAT GCA ACA CAA GGA GGA GAG TGT TCA 
He Arg Asp Thr Asp Thr He Tyr Ala Thr Gin Gly Gly Glu Cys Ser 
1170 1175 1180 

CAA AAT TGT GTA TAT GCA TTA AAA TTA CAG AGT AAT TTA GGT AAT TAT 
Gin Asn Cys Val Tyr Ala Leu Lys Leu Gin Ser Asn Leu Gly Asn Tyr 
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10 



20 



60 



1185 1190 1195 1200 

GGT ATA GGT ATA TTT AGT ATA AAA AAT ATT GTA TCT AAA AAT AAA TAT 3648 
Gly He Gly He Phe Ser He Lys Asn He Val Ser Lya Asn Lys Tyr 
1205 1210 1215 

TGT AGT CAA ATT TTC TCT AGT TTT AGG GAA AAT ACA ATG CTT CTA GCA 3696 
Cys Ser Gin He Phe Ser Ser Phe Arg Glu Asn Thr Met Leu Leu Ala 
1220 1225 1230 

GAT ATA TAT AAA CCT TGG AGA TTT TCT TTT AAA AAT GCA TAC ACG CCA 3744 
Asp He Tyr Lys Pro Trp Arg Phe Ser Phe Lys Asn Ala Tyr Thr Pro 
1235 1240 1245 



15 GTT GCA GTA ACT AAT TAT GAA ACA AAA CTA TTA TCA ACT TCA TCT TTT 

Val Ala Val Thr Asn Tyr Glu Thr Lys Leu Leu Ser Thr Ser Ser Phe 
1250 1255 1260 



(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 1276 amino acids 
<D) TYPE: amino acid 
(D) TOPOLOGY: linear 

-^0 (ii) MOLECULE TYPE: protein 

(xil SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Met Thr Trp Pro Val Lys Asp Phe Asn Tyr Ser Asp Pro Val Asn Asp 
-'^ 1 5 10 15 

Asn Asp He Leu Tyr Leu Arg He Pro Gin Asn Lys Leu He Thr Thr 
20 25 30 

40 Pro Val Lys Ala Phe Met He Thr Gin Asn He Trp Val He Pro Glu 
35 40 45 

Arg Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arg Pro 
50 55 60 

Thr Ser Lys Tyr Gin Ser Tyr Tyr Asp Pro Ser Tvr Leu Ser Thr Asp 
65 70 75 ' 80 

Glu Gin Lys Asp Thr Phe Leu Lys Gly He He Lyc Leu Phe Lys Arg 
.'^O 85 90 95 

He Asn Glu Arg Asp He Gly Lys Lys Leu He Asn Tyr Leu Val Val 

100 105 110 

5:> Gly Ser Pro Phe Met Gly Asp Ser Ser Thr Pro Glu Asp Thr Phe Asp 
115 120 125 



Phe Thr Arg His Thr Thr Asn He Ala Val Glu Lys Phe Glu Asn Gly 

130 135 140 

Ser Trp Lys Val Thr Asn He He Thr Pro Ser Val Leu He Phe Gly 
145 150 155 160 



Pro Leu Pro Asn He Leu Asp Tyr Thr Ala Ser Leu Thr Leu Gin Gly 

6:) 165 170 175 

Gin Gin Ser Asn Pro Ser Phe Glu Gly Phe Gly Thr Leu Ser He Leu 

180 185 190 

70 Lys Val Ala Pro Glu Phe Leu Leu Thr Phe Ser Asp Val Thr Ser Asn 



3792 



TGG AAA TTT ATT TCT AGG GAT CCA GGA TGG GTA GAG TAA 3331 
Trp Lys Phe He Ser Arg Asp Pro Gly Trp Val Glu 
1265 1270 1275 

iZ) INFORMATION FOR SEQ ID NO: 66: 
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155 200 205 

Gin Ser Ser Ala Val Leu Gly Lys Ser He Phe Cvs Met Asp Pro Val 
210 215 220 

He Ala Leu Met His Glu Leu Thr His Ser Leu His Gin Leu Tyr Gly 
225 230 235 240 

He Asn He Pro Ser Asp Lys Arg He Arg Pro Gin Val Ser Glu Gly 
245 250 255 

Phe Phe Ser Gin Asp Gly Pro Asn Val Gin Phe Glu Glu Leu Tyr Thr 
260 265 270 

Phe Gly Gly Leu Asp Val Glu He He Pro Gin He Glu Arg Ser Gin 
275 280 285 

Leu Arg Glu Lys Ala Leu Gly His Tyr Lys Asp He Ala Lys Arq Leu 
^90 295 300 

Asn Asn He Asn Lys Thr He Pro Ser Ser Tro He Ser Asn He Asp 

310 315 320 

Lys Tyr Lys Lys He Phe Ser Glu Lys Tyr Asn Phe Asp Lys Asp Asn 
325 330 335 

Thr Gly Asn Phe Val Val Acn He Asp Lys Phe Asn Ser Leu Tvr Ser 
340 345 350 

Asp Lou Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gin Tyr Asn 
355 360 365 

Val Lys Asn Arg Thr His Tyr Phe Ser Arg His Tyr Leu Pro Val Phe 
370 375 380 

Ala Aon He Leu Asp Asp Asn He Tyr Thr He Arg Asp Gly Phe Asn 
385 390 395 400 

Leu Thr Asn Lys Gly Phe Asn He Glu Asn Ser Gly Gin Asn He Glu 
405 410 415 

Arg Asn Pro Ala Leu Gin Lys Leu Ser Ser Glu Ser Val Val Asp Leu 
420 425 430 

Phe Thr Lys Val Cys Leu Arg Leu Thr Lys Asn Ser Arg Asp Asp Ser 
435 440 445 

Thr Cys He Lys Val Lys Asn Asn Arg Leu Pro Tvr Val Ala Asp Lys 
450 455 460 

Asp Ser He Ser Gin Glu He Phe Glu Asn Lys lie He Thr Asp Glu 
465 470 475 480 

Thr Asn Val Gin Asn Tyr Ser Asp Asn Phe Ser Leu Asp Glu Ser He 
485 490 495 

Leu Asp Gly Gin Val Pro He Asn Pro Glu He Val Asp Pro Leu Leu 
500 505 510 

Pro Asn Val Asn Met Glu Pro Leu Asn Leu Pro Gly Glu Glu He Val 
5T5 520 525 

Phe Tyr Asp Asp He Thr Lys Tyr Val Asp Tyr Leu Asn Ser Tvr Tvr 
530 535 540 ' 

Tyr Leu Glu Ser Gin Lys Leu Ser Asn Asn Val Glu Asn He Thr Leu 
545 550 555 560 

Thr Thr Ser Val Glu Glu Ala Leu Gly Tyr Ser Asn Lys He Tyr Thr 
565 570 575 
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Phe Leu Pro Ser Leu Ala Glu Lys Val Asn Lys Gly Val Gin Ala Glv 

580 585 590 

Leu Phe Leu Asn Trp Ala Asn Glu Val Val Glu Asp Phe Thr Thr Asn 

600 605 

He Met Lys Lys Asp Thr Leu Asp Lys He Ser Asp Val Ser Val He 

bio 



620 



lie Pro Tyr He Gly Pro Ala Leu Asn He Gly Asn Ser Ala Leu Arq 

630 635 640 

Gly Asn Phe Lys Gin Ala Phe Ala Thr Ala Gly Val Ala Phe Leu Leu 

650 655 

Glu Gly Phe Pro Glu Phe Thr He Pro Ala Leu Gly Val Phe Thr Phe 

665 670 

Tyr Ser Ser He Gin Glu Arg Glu Lys He He Lys Thr He Glu Asn 
6^5 680 685 

Cys Leu Glu Gin Arg Val Lys Arg Trp Lys Asp Ser Tyr Gin Trp Met 

Val Ser Asn Trp Leu Ser Arg He Thr Thr Gin Phe Asn His He Asn 

"^iO 715 720 

Tyr Gin Met Tyr Asp Ser Leu Ser Tyr Gin Ala Asp Ala He Lys Ala 
725 730 735 

Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp Lvs Glu Asn 

745 750 

He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu Asp Val Lys He 
755 760 765 

Ser Glu Ala Met Asn Asn He Asn Lys Phe He Arg Glu Cys Ser Val 
'"0 775 780 

Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val He Asp Glu Leu Asn 

790 795 800 

Lys Phe Asp Leu Arg Thr Lys Thr Glu Leu He Asn Leu He Asp Ser 
805 810 815 

His Acn He He Leu Val Gly Glu Val Asp Arg Leu Lys Ala Lys Val 
820 825 830 

Asn Glu ser Phe Glu Asn Thr Met Pro Phe Asn He Phe Ser Tyr Thr 
835 840 845 

Asn Asn Ser Leu Leu Lys Asp He He Asn Glu Tyr Phe Asn Ser He 
850 855 860 

Asn Asp Ser Lys He Leu Ser Leu Gin Asn Lys Lys Asn Ala Leu Val 
®65 870 875 880 

Asp Thr Ser Gly Tyr Asn Ala Glu Val Arg Val Gly Asp Asn Val Gin 
885 890 895 

Leu Asn Thr He Tyr Thr Asn Asp Phe Lys Leu Ser Ser Ser Gly Asd 
900 905 910 

Lys He He Val Asn Leu Asn Asn Asn He Leu Tvr Ser Ala He Tyr 
915 920 ' 925 

Glu Asn Ser Ser Val Ser Phe Trp He Lys He Ser Lys Asp Leu Thr 
930 935 940 

Asn Ser His Asn Glu Tyr Thr He He Asn Ser He Glu Gin Asn Ser 

- 350 - 



wo 98/08540 



PCT/US97/I5394 



945 950 955 960 

Gly Trp Lys Leu Cys He Arg Asn Gly Asn He Glu Trp He Leu Gin 
. 965 970 975 

Asp Val Asn Arg Lys Tyr Lys Ser Leu He Phe Asp Tyr Ser Glu Ser 
980 985 990 

Leu Ser His Thr Gly Tyr Thr Asn Lys Trp Phe Phe Val Thr He Thr 
995 1000 1005 



15 



20 



35 



40 



45 



50 



Asn Asn He Met Gly Tyr Met Lys Leu Tyr He Asn Gly Glu Leu Lys 
1010 1015 1020 

Gin Ser Gin Lys He Glu Asp Leu Asp Glu Val Lys Leu Asp Lys Thr 
1025 1030 1035 1040 

He Val Phe Gly He Asp Glu Asn He Asp Glu Asn Gin Met Leu Trp 
1045 1050 1055 

He Arg Asp Phe Asn He Phe Ser Lys Glu Leu Ser Asn Glu Asp He 
1060 1065 1070 

Asn He Val Tyr Glu Gly Gin He Leu Arg Asn Val He Lys Asp Tvr 
1075 1080 1085 

Trp Gly Asn Pro Leu Lys Phe Asp Thr Glu Tyr Tyr He He Asn Asp 
1090 1095 1100 

Asn Tyr He Asp Arg Tyr He Ala Pro Glu Ser Asn Val Leu Val Leu 
1105 1110 1115 1120 

Val Arg Tyr Pro Asp Arg Ser Lys Leu Tyr Thr Gly Asn Pro He Thr 
1125 1130 1135 

He Lys Ser Val Ser Asp Lys Asn Pro Tyr Ser Arg He Leu Asn Gly 
1140 1145 1150 

Asp Asn He He Leu His Met Leu Tyr Asn Ser Arg Lys Tyr Met Ho 
1155 1160 1165 

He Arg Asp Thr Asp Thr He Tyr Ala Thr Gin Gly Gly Glu Cys Ser 
1170 1175 1180 

Gin Asn Cvs Val Tyr Ala Leu Lys Leu Gin Ser Asn Leu Gly Asn Tvr 
1185 1190 1195 1200 

Gly He Gly He Phe Ser He Lys Asn He Val Ser Lys Asn Lys Tyr 
1205 1210 1215 

Cys Ser Gin He Phe Ser Ser Phe Arg Glu Asn Thr Met Leu Leu Ala 
1220 1225 1230 



Asp He Tyr Lys Pro Trp Arg Phe Ser Phe Lys Asn Ala Tyr Thr Pro 
-^-^ 1235 1240 1245 



6(1 



Val Ala Val Thr Asn Tyr Glu Thr Lys Leu Leu Ser Thr Ser Ser Phe 
1250 1255 1260 

Trp Lys Phe He Ser Arg Asp Pro Gly Trp Val Glu 

1265 1270 1275 
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(2) INFORMATION FOR SEQ ID NO: 67; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1469 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

fix} FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 108.. 1460 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 67: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 
His His His His His His His His His Ser Ser Gly His lie fly 
5 10 IS 

CGT CAT ATG GCT AGC ATG GCT TTA TTA AAA GAT ATA ATT AAT GAA TAT 
Arg His Met Ala Ser Met Ala Leu Leu Lys Asp He He Asn Glu Tyr 

25 30 35 

Dhf c^"^ ^^'^ ^ ^'^ AGC TTA CAA AAC AAA AAA : 

Phe Asn Ser He Asn Asp Ser Lys He Leu Ser Leu Gin Asn Lys Lys 
40 45 50 

AAT GCT TTA GTG GAT ACA TCA GGA TAT AAT GCA GAA GTG AGG GTA GGA 3 
Asn Ala Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val Arg val Glv 
55 60 65 

GAT AAT GTT CAA CTT AAT ACG ATA TAT ACA AAT GAC TTT AAA TTA AGT 3 
Asp Asn Val Gin Leu Asn Thr He Tyr Thr Asn Asp Phe Lys Leu Ser 
'^^ 75 80 

AGT TCA GGA GAT AAA ATT ATA GTA AAT TTA AAT AAT AAT ATT TTA TAT 4 
Ser Ser Gly Asp Lys He He Val Asn Leu Asn Asn Asn He Leu Tyr 
85 90 95 ' 

AGC GCT ATT TAT GAG AAC TCT AGT GTT AGT TTT TGG ATT AAG ATA TCT 4 
Ser Ala He Tyr Glu Asn ser Ser Val Ser Phe Trp He Lys He Ser 

105 110 115 

AAA GAT TTA ACT AAT TCT CAT AAT GAA TAT ACA ATA ATT AAC AGT ATA 5 
Lys Asp Leu Thr Asn Ser His Asn Glu Tyr Thr He He Asn Ser He 
120 125 130 

GAA CAA AAT TCT GGG TGG AAA TTA TGT ATT AGG AAT GGC AAT ATA GAA 5- 
Glu Gin Asn Ser Gly Trp Lys Leu Cys He Arg Asn Gly Asn He Glu 
135 140 145 

TGG ATT TTA CAA GAT GTT AAT AGA AAG TAT AAA AGT TTA ATT TTT GAT 5 
Trp He Leu Gin Asp Val Asn Arg Lys Tyr Lys Ser Leu He Phe Asp 
150 155 160 

TAT AGT GAA TCA TTA AGT CAT ACA GGA TAT ACA AAT AAA TGG TTT TTT 6 
Tyr Ser Glu Ser Leu Ser His Thr Gly Tyr Thr Asn Lys Trp Phe Phe 
165 170 175 

GTT ACT ATA ACT AAT AAT ATA ATG GGG TAT ATG AAA CTT TAT ATA AAT 6' 
Val Thr He Thr Asn Asn He Met Gly Tyr Met Lys Leu Tyr He Asn 
180 185 190 195 
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20 



40 



50 



GGA GAA TTA AAG CAG AGT CAA AAA ATT GAA GAT TTA GAT GAG GTT AAG 
.:»ly Glu Leu Lys Gin Ser Gin Lys He Glu Asp Leu Asp Glu Val Lys 
200 205 210 

TTA GAT AAA ACC ATA GTA TTT GGA ATA GAT GAG AAT ATA GAT GAG AAT 
Leu Asp Lys Thr lie Val Phe Gly He Asp Glu Asn He Asp Glu Asn 
215 220 225 



AAT GAA GAT ATT AAT ATT GTA TAT GAG GGA CAA ATA TTA AGA AAT GTT 
Asn Glu Asp He Asn He Val Tyr Glu Gly Gin He Leu Arg Asn Val 

250 255 

ATT AAA GAT TAT TGG GGA AAT CCT TTG AAG TTT GAT ACA GAA TAT TAT 
He i.ys Asp Tyr Trp Gly Asn Pro Leu Lys Phe Asp Thr Glu Tyr Tvr 

265 270 275 

ATT ATT AAT GAT AAT TAT ATA GAT AGG TAT ATT GCA CCT GAA AGT AAT 
He He Asn Asp Asn Tyr He Asp Arg Tyr He Ala Pro Glu Ser Asn 
280 285 290 

GTA CTT GTA CTT GTT CGG TAT CCA GAT AGA TCT AAA TTA TAT ACT GGA 
.al Leu Val Leu Val Arg Tyr Pro Asp Arg Ser Lys Leu Tyr Thr Gly 
295 300 305 



AAA TAT ATG ATA ATA AGA GAT ACT GAT ACA ATA TAT GCA ACA CAA GGA 

Lys Tyr Met He He Arg Asp Thr Asp Thr He Tyr Ala Thr Gin Glv 

^^0 345 350 355 

GGA GAG TGT TCA CAA AAT TGT GTA TAT GCA TTA AAA TTA CAG AGT AAT 

Gly Glu Cys Ser Gin Asn Cys Val Tyr Ala Leu Lys Leu Gin Ser Asn 
360 365 370 

4.^ TTA GGT AAT TAT GGT ATA GGT ATA TTT AGT ATA AAA AAT ATT GTA TCT 

Leu Gly Asn Tyr Gly He Gly He Phe Ser He Lys Asn He Val Ser 

375 380 385 



ATG CTT CTA GCA GAT ATA TAT AAA CCT TGG AGA TTT TCT TTT AAA AAT 
Met Leu Leu Ala Asp He Tyr Lys Pro Trp Arg Phe Ser Phe Lys Asn 
405 410 

GCA TAC ACG CCA GTT GCA GTA ACT AAT TAT GAA ACA AAA CTA TTA TCA 
Ala Tyr Thr Pro Val Ala Val Thr Asn Tyr Glu Thr Lys Leu Leu Ser 
(^Q 425 430 435 

ACT TCA TCT TTT TGG AAA TTT ATT TCT AGG GAT CCA GGA TGG GTA GAG 
Thr Ser Ser Phe Trp Lys Phe He Ser Arg Asp Pro Gly Trp Val Glu 
440 445 450 

65 TAAAAGCTT 

(2) INFORMATION FOR SEQ ID NO: 68: 



70 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 amino acids 



740 



788 



H) M*^? P° ^T*^ ^^"^ '^'^ '^TT TTT TCT AAA GAA TTA AGT 836 

W cm Met Leu Trp He Arg Asp Phe Asn He Phe Ser Lys Glu Leu Se- 
230 235 240 



864 



932 



980 



1028 



•if) » tT^ ^ "^^^ ^'^^ ^AT AAG AAT CCT TAT AGT AGA 1076 

M) Asn Pro He Thr He Lys Ser Val Ser Asp Lys Asn Pro Tyr Ser Arq 
310 315 320 

ATT TTA AAT GGA GAT AAT ATA ATT CTT CAT ATG TTA TAT AAT AGT AGG 1124 

^ lie Leu Asn Gly Asp Asn He He Leu His Met Leu Tyr Asn Ser Arq 

•'•^ 325 330 335 ^ 



1172 



1220 



1268 



AAA AAT AAA TAT TGT AGT CAA ATT TTC TCT AGT TTT AGG GAA AAT ACA 1316 
Lys Asn Lys Tyr Cys Ser Gin He Phe Ser Ser Phe Arg Glu Asn Thr 

390 395 400 



1364 



1412 



1460 



1469 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;68: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
^5 10 



15 



lie Glu Gly Arg His Met Ala Ser Met Ala Leu Leu Lys Asp He He 
20 25 30 

Aan Glu Tyr Phe Asn Ser He Asn Asp Ser Lys He Leu Ser Leu Gin 
35 40 45 

Asn Lys Lys Asn Ala Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val 
5° 55 60 

Arg Val Gly Asp Asn Val Gin Leu Asn Thr He Tyr Thr Asn Asp Phe 

''O 75 80 

Lys Leu Ser Ser Ser Gly Asp Lys He He Val Asn Leu Asn Asn Asn 
85 90 95 

He Leu Tyr Ser Ala He Tyr Glu Asn Ser Ser Val Ser Phe Trp He 

105 no 

Lyc He Ser Lys Asp Leu Thr Asn Ser His Asn Glu Tyr Thr He He 
115 120 125 

Asn Ser He Glu Gin Asn Ser Gly Trp Lys Leu Cys He Arg Asn Glv 

135 140 

Asn He Glu Trp He Leu Gin Asp Val Asn Arg Lys Tyr Lys Ser Leu 

150 155 j^^Q 

He Phe Asp Tyr Ser Glu Ser Leu Ser His Thr Gly Tyr Thr Asn Lys 
165 170 175 

Trp Phe Phe Val Thr He Thr Asn Asn He Met Glv Tyr Met Lys Leu 
180 185 ' 190 

Tyr He Asn Gly Glu Leu Lys Gin Ser Gin Lys He Glu Asp Leu Asd 
i95 200 205 ^ 

Glu Val Lys Leu Asp Lys Thr He Val Phe Gly He Asp Glu Asn Ho 
210 215 220 

Asp Glu Asn Gin Met Leu Trp He Arg Asp Phe Asn He Phe Ser Lys 
225 230 235 240 

Glu Leu Ser Asn Glu Asp He Asn He Val Tyr Glu Gly Gin He Leu 
245 250 255 

Arg Asn Val He Lys Asp Tyr Trp Gly Asn Pro Leu Lys Phe Asp Thr 
260 265 270 

Glu Tyr Tyr He He Asn Asp Asn Tyr He Asp Arg Tyr He Ala Pro 
275 280 285 

Glu Ser Asn Val Leu Val Leu Val Arg Tyr Pro Asp Arg Ser Lys Leu 
-90 295 300 

Tyr Thr Gly Asn Pro He Thr He Lys Ser Val Ser Asp Lys Asn Pro 
305 310 315 320 

Tyr Ser Arg He Leu Asn Gly Asp Asn He He Leu His Met Leu Tyr 
325 330 335 
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Asn Ser Arg Lys 
340 

Thr Gin Gly Gly 
355 

Gin Ser Asn Leu 
370 

He Val Ser Lvs 
385 

Glu Asn Thr Met 



Phe Lys Asn Ala 
420 

Leu Leu Ser Thr 
435 

Trp Val Glu 
450 



Tyr Met He He 



Glu Cys Ser Gin 
360 

Gly Asn Tyr Gly 

375 

Asn Lys Tyr Cys 
390 

Leu Leu Ala Asp 
405 

Tyr Thr Pro Val 



Ser Ser Phe Trp 
440 



Arg Asp Thr Asp 
345 

Asn Cys Val Tyr 



He Gly He Phe 
380 

Ser Gin He Phe 
395 

He Tyr Lys Pro 
410 

Ala Val Thr Asn 
425 



Lys Phe He Ser 



Thr He Tyr Ala 

350 

Ala Leu Lys Leu 
365 

Ser He Lys Asn 



Ser Ser Phe Arg 
400 

Trp Arg Phe Ser 
415 

Tyr Glu Thr Lys 
430 



Arg Asp Pro Gly 
445 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GCAAGCTTTT ACTCTACCCA TCCTGGATCC CT 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 82 5 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3822 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

ATG CCA GTT GCA ATA AAT ACT TTT AAT TAT AAT GAC CCT GTT AAT GAT 
Met Pro Val Ala He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asp 
^5 10 15 

GAT ACA ATT TTA TAC ATG CAG ATA CCA TAT GAA GAA AAA AGT AAA AAA 
Asp Thr He Leu Tyr Met Gin He Pro Tyr Glu Glu Lvs Ser Lys Lvs 
20 25 "30 

TAT TAT AAA GCT TTT GAG ATT ATG CGT AAT GTT TGG ATA ATT CCT GAG 
Tyr Tyr Lys Ala Phe Glu He Met Arg Asn Val Trp He He Pro Glu 
35 40 45 

AGA AAT ACA ATA GGA ACG AAT CCT AGT GAT TTT GAT CCA CCG GCT TCA 
Arg Asn Thr He Gly Thr Asn Pro Ser Asp Phe Asp Pro Pro Ala Ser 
50 55 60 
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240 



336 



384 



432 



480 



528 



576 



TTA AAG AAC GGA AGC AGT GCT TAT TAT GAT CCT AAT TAT TTA ACC ACT 
Leu Lys Asn Gly Ser Ser Ala Tyr Tyr Asp Pro Asn Tyr Leu Thr Thr 
^5 70 75 Qo 

GAT GCT GAA AAA GAT AGA TAT TTA AAA ACA ACG ATA AAA TTA TTT AAG 288 
Asp Ala Glu Lys Asp Arg Tyr Leu Lys Thr Thr He Lys Leu Phe Lys 
85 90 95 

AGA ATT AAT AGT AAT CCT GCA GGG AAA GTT TTG TTA CAA GAA ATA TCA 
Arg He Asn Ser Asn Pro Ala Gly Lys Val Leu Leu Gin Glu He Ser 
100 105 110 

TAT GCT AAA CCA TAT TTA GGA AAT GAC CAC ACG CCA ATT GAT GAA TTC 
Tyr Ala Lys Pro Tyr Leu Gly Asn Asp His Thr Pro He Asp Glu Phe 

120 125 

TCT CCA GTT ACT AGA ACT ACA AGT GTT AAT ATA AAA TTA TCA ACT AAT 
Ser Pro Val Thr Arg Thr Thr Ser Val Asn He Lys Leu Ser Thr Asn 
130 135 j^4Q 

GTT GAA AGT TCA ATG TTA TTG AAT CTT CTT GTA TTG GGA GCA GGA CCT 
Val Glu Ser Ser Met Leu Leu Asn Leu Leu Val Leu Gly Ala Glv Pro 

150 155 - 

GAT ATA TTT GAA AGT TGT TGT TAC CCC GTT AGA AAA CTA ATA GAT CCA 
Asp He Phe Glu Ser Cys Cys Tyr Pro Val Arg Lys Leu He Asp Pro 
165 170 175 

GAT GTA GTT TAT GAT CCA AGT AAT TAT GGT TTT GGA TCA ATT AAT ATC 
Asp Val Val Tyr Asp Pro Ser Asn Tyr Gly Phe Gly Ser He Asn He 
180 185 190 

GTG ACA TTT TCA CCT GAG TAT GAA TAT ACT TTT AAT GAT ATT AGT GGA 624 
Val Thr Phe Ser Pro Glu Tyr Glu Tyr Thr Phe Asn Asp He Ser Glv 
195 200 205 

GGG CAT AAT AGT AGT ACA GAA TCA TTT ATT GCA GAT CCT GCA ATT TCA 672 
Gly HiG Asn Ser Ser Thr Glu Ser Phe He Ala Asp Pro Ala He Ser 
210 215 220 

CTA GCT CAT GAA TTG ATA CAT GCA CTG CAT GGA TTA TAC GGG GCT AGG 720 
Leu Ala His Glu Leu He His Ala Leu His Gly Leu Tyr Gly Ala Arg 
225 230 235 240 

GGA GTT ACT TAT GAA GAG ACT ATA GAA GTA AAG CAA GCA CCT CTT ATG 768 
Gly Val Thr Tyr Glu Glu Thr He Glu Val Lys Gin Ala Pro Leu Met 
245 250 255 

ATA GCC GAA AAA CCC ATA AGG CTA GAA GAA TTT TTA ACC TTT GGA GGT 816 
He Ala Glu Lys Pro He Arg Leu Glu Glu Phe Leu Thr Phe Glv Glv 
260 265 270 

CAG GAT TTA AAT ATT ATT ACT AGT GCT ATG AAG GAA AAA ATA TAT AAC 864 
Gin Asp Leu Asn He He Thr Ser Ala Met Lys Glu Lys He Tyr Asn 
275 280 285 

AAT CTT TTA GCT AAC TAT GAA AAA ATA GCT ACT AGA CTT AGT GAA GTT 912 
Asn Leu Leu Ala Asn Tyr Glu Lys He Ala Thr Arg Leu Ser Glu Val 

290 295 300 

AAT AGT GCT CCT CCT GAA TAT GAT ATT AAT GAA TAT AAA GAT TAT TTT 960 
Asn Ser Ala Pro Pro Glu Tyr Asp He Asn Glu Tyr Lys Asp Tyr Phe 
305 310 315 320 

CAA TGG AAG TAT GGG CTA GAT AAA AAT GCT GAT GGA AGT TAT ACT GTA 1008 
Gin Trp Lys Tyr Gly Leu Asp Lys Asn Ala Asp Gly Ser Tyr Thr Val 
325 330 335 

AAT GAA AAT AAA TTT AAT GAA ATT TAT AAA AAA TTA TAT AGT TTT ACA 1056 
Asn Glu Asn Lys Phe Asn Glu He Tyr Lys Lys Leu Tyr Ser Phe Thr 
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340 345 350 

GAG AGT GAC TTA GCA AAT AAA TTT AAA GTA AAA TGT AGA AAT ACT TAT 
Glu Ser Asp Leu Ala Asn Lys Phe Lys Val Lys Cys Arg Asn Thr Tyr 
355 360 365 

TTT ATT AAA TAT GAA TTT TTA AAA GTT CCA AAT TTG TTA GAT GAT GAT 
Phe He Lys Tyr Glu Phe Leu Lys Val Pro Asn Leu Leu Asp Asp Asp 
370 375 380 

ATT TAT ACT GTA TCA GAG GGG TTT AAT ATA GGT AAT TTA GCA GTA AAC 
He Tyr Thr Val Ser Glu Gly Phe Asn He Gly Asn Leu Ala V^l Asn 
385 390 395 400 

AAT CGC GGA CAA AGT ATA AAG TTA AAT CCT AAA ATT ATT GAT TCC ATT 
Asn Arg Gly Gin Ser He Lys Leu Asn Pro Lys He He Asp Ser He 
405 410 415 

CCA GAT AAA GGT CTA GTA GAA AAG ATC GTT AAA TTT TGT AAG AGC GTT 
Pro Asp Lys Gly Leu Val Glu Lys He Val Lys Phe Cys Lys Ser Val 
420 425 430 

ATT CCT AGA AAA GGT ACA AAG GCG CCA CCG CGA CTA TGC ATT AGA GTA 
He Pro Arg Lys Gly Thr Lys Ala Pro Pro Arg Leu Cys He Arg Val 
435 440 445 

AAT AAT AGT GAG TTA TTT TTT GTA GCT TCA GAA AGT AGC TAT AAT GAA 
Asn Asn Ser Glu Leu Phe Phe Val Ala Ser Glu Ser Ser Tyr Asn Glu 
'J50 455 460 

AAT GAT ATT AAT ACA CCT AAA GAA ATT GAC GAT ACA ACA AAT CTA AAT 
Asn Asp He Asn Thr Pro Lys Glu He Asp Asp Thr Thr Asn Leu Asn 

470 475 480 

AAT AAT TAT AGA AAT AAT TTA GAT GAA GTT ATT TTA GAT TAT AAT AGT 
Asn Asn Tyr Arg Asn Asn Leu Asp Glu Val He Leu Asp Tyr Asn Ser 
485 490 495 

CAG ACA ATA CCT CAA ATA TCA AAT CGA ACA TTA AAT ACA CTT GTA CAA 
Gin Thr He Pro Gin He Ser Asn Arg Thr Leu Asn Thr Leu Val Gin 
500 505 510 

GAC AAT AGT TAT GTG CCA AGA TAT GAT TCT AAT GGA ACA AGT GAA ATA 
Asp Asn Ser Tyr Val Pro Arg Tyr Asp Ser Asn Gly Thr Ser Glu He 
515 520 525 

GAG GAA TAT GAT GTT GTT GAC TTT AAT GTA TTT TTC TAT TTA CAT GCA 
Glu Glu Tyr Asp Val Val Asp Phe Asn Val Phe Phe Tyr Leu His Ala 
530 535 540 

CAA AAA GTG CCA GAA GGT GAA ACC AAT ATA AGT TTA ACT TCT TCA ATT 
Gin Lys Val Pro Glu Gly Glu Thr Asn He Ser Leu Thr Ser Ser He 
545 550 555 560 

GAT ACA GCA TTA TTA GAA GAA TCC AAA GAT ATA TTT TTT TCT TCA GAG 
Asp Thr Ala Leu Leu Glu Glu Ser Lys Asp He Phe Phe Ser Ser Glu 
565 570 575 

TTT ATC GAT ACT ATC AAT AAA CCT GTA AAT GCA GCA CTA TTT ATA GAT 
Phe He Asp Thr He Asn Lys Pro Val Asn Ala Ala Leu Phe He Asp 
580 585 590 

TGG ATA AGC AAA GTA ATA AGA GAT TTT ACC ACT GAA GCT ACA CAA AAA 
Trp He Ser Lys Val He Arg Asp Phe Thr Thr Glu Ala Thr Gin Lys 
595 600 605 

AGT ACT GTT GAT AAG ATT GCA GAC ATA TCT TTA ATT GTA CCC TAT GTA 
Ser Thr Val Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr Val 
610 615 620 
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GGT CTT GCT TTG AAT ATA ATT ATT GAG GCA GAA AAA GGA AAT TTT GAG 
Gly Leu Ala Leu Asn He He He Glu Ala Glu Lys Gly Asn Phe Glu 

€30 640 

GAG GCA TTT GAA TTA TTA GGA GTG GGT ATT TTA TTA GAA TTT GTG CCA 
Glu Ala Phe Glu Leu Leu Gly Val Gly He Leu Leu Glu Phe Val Pro 
€45 650 655 

GAA CTT ACA ATT CCT GTA ATT TTA GTG TTT ACG ATA AAA TCC TAT ATA 
Glu Leu Thr He Pro Val He Leu Val Phe Thr He Lys Ser Tvr He 

665 670 

GAT TCA TAT GAG AAT AAA AAT AAA GCA ATT AAA GCA ATA AAT AAT TCA 
Asp ser Tyr Glu Asn Lys Asn Lys Ala He Lys Ala He Asn Asn Ser 
€75 680 685 

TTA ATC GAA AGA GAA GCA AAG TGG AAA GAA ATA TAT AGT TGG ATA GTA 
Leu He Glu Arg Glu Ala Lys Trp Lys Glu He Tyr Ser Trp He Val 
690 695 700 

TCA AAT TGG CTT ACT AGA ATT AAT ACT CAA TTT AAT AAA AGA AAA GAG 
Ser Asn Trp Leu Thr Arg He Asn Thr Gin Phe Asn Lys Arg Lys Glu 

710 715 ^ y y 

CAA ATG TAT CAG GCT TTA CAA AAT CAA GTA GAT GCA ATA AAA ACA GCA 
Gin Met Tyr Gin Ala Leu Gin Asn Gin Val Asp Ala He Lys Thr Ala 
725 730 735 

ATA GAA TAT AAA TAT AAT AAT TAT ACT TCA GAT GAG AAA AAT AGA CTT 
He Glu Tyr Lys Tyr Asn Asn Tyr Thr Ser Asp Glu Lys Asn Arg Leu 
740 745 750 

GAA TCT GAA TAT AAT ATC AAT AAT ATA GAA GAA GAA TTG AAT AAA AAA 
Glu Ser Glu Tyr Asn He Asn Asn He Glu Glu Glu Leu Asn Lvs Lvs 
755 760 765 

GTT TCT TTA GCA ATG AAA AAT ATA GAA AGA TTT ATG ACA GAA AGT TCT 
Val Ser Leu Ala Met Lys Asn He Glu Arg Phe Met Thr Glu Ser Ser 
770 775 780 

ATA TCT TAT TTA ATG AAA TTA ATA AAT GAA GCC AAA GTT GGT AAA TTA 2400 
He Ser Tyr Leu Met Lys Leu He Asn Glu Ala Lys Val Gly Lys Leu 
"795 790 795 800 

AAA AAA TAT GAT AAC CAT GTT AAG AGC GAT TTA TTA AAC TAT ATT CTC 244 8 

Lys Lys Tyr Asp Asn His Val Lys Ser Asp Leu Leu Asn Tyr He Leu 
805 810 815 

GAC CAT AGA TCA ATC TTA GGA GAG CAG ACA AAT GAA TTA AGT GAT TTG 2496 
Asp His Arg Ser He Leu Gly Glu Gin Thr Asn Glu Leu Ser Asp Leu 
820 825 830 

GTG ACT AGT ACT TTG AAT AGT AGT ATT CCA TTT GAA CTT TCT TCA TAT 2544 
Val Thr Ser Thr Leu Asn Ser Ser He Pro Phe Glu Leu Ser Ser Tyr 
835 840 845 

ACT AAT GAT AAA ATT CTA ATT ATA TAT TTT AAT AGA TTA TAT AAA AAA 2 592 

Thr Asn Asp Lys He Leu He He Tyr Phe Asn Arg Leu Tyr Lys Lys 
850 855 860 

ATT AAA GAT AGT TCT ATT TTA GAT ATG CGA TAT GAA AAT AAT AAA TTT 264 0 

He Lys Asp Ser Ser He Leu Asp Met Arg Tyr Glu Asn Asn Lvs Phe 
865 870 875 * 880 



1920 



1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 
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ATA GAT ATC TCT GGA TAT GGT TCA AAT ATA AGC ATT AAT GGA AAC GTA 2688 
He Asp He Ser Gly Tyr Gly Ser Asn He Ser He Asn Gly Asn Val 
885 890 895 

TAT ATT TAT TCA ACA AAT AGA AAT CAA TTT GGA ATA TAT AAT AGT AGG 2736 
Tyr He Tyr Ser Thr Asn Arg Asn Gin Phe Gly He Tyr Asn Ser Arq 
900 905 910 

CTT AGT GAA GTT AAT ATA GCT CAA AAT AAT GAT ATT ATA TAG AAT AGT 2784 
Leu Ser Glu Val Asn He Ala Gin Asn Asn Asp He He Tvr Asn Ser 
915 920 925 

AGA TAT CAA AAT TTT AGT ATT AGT TTC TGG GTA AGG ATT OCT AAA CAC 2832 
Arg Tyr Gin Asn Phe Ser He Ser Phe Trp Val Arg He Pro Lys His 
930 935 940 

TAC AAA CCT ATG AAT CAT AAT CGG GAA TAC ACT ATA ATA AAT TGT ATG 2880 
Tyr Lys Pro Met Asn His Asn Arg Glu Tyr Thr He He Asn Cys Met 
945 950 955 960 

GGG AAT AAT AAT TCG GGA TGG AAA ATA TCA CTT AGA ACT GTT AGA GAT -^928 
Gly Asn Asn Asn Ser Gly Trp Lys He Ser Leu Arg Thr Val Arg Asp 
965 970 975 

TGT GAA ATA ATT TGG ACT TTA CAA GAT ACT TCT GGA AAT AAG GAA AAT ^976 
Cys CUu He He Trp Thr Leu Gin Asp Thr Ser Gly Asn Lys Glu Asn 
980 985 990 

TTA ATT TTT AGG TAT GAA GAA CTT AAT AGG ATA TCT AAT TAT ATA AAT 3024 
Leu He Phe Arg Tyr Glu Glu Leu Asn Arg He Ser Asn Tyr He Asn 
995 1000 1005 

AAA TGG ATT TTT GTA ACT ATT ACT AAT AAT AGA TTA GGC AAT TCT AGA 3 072 

Lys Trp He Phe Val Thr He Thr Asn Asn Arg Leu Gly Asn Ser Arg 
1010 1015 1020 

ATT TAC ATC AAT GGA AAT TTA ATA GTT GAA AAA TCA ATT TCG AAT TTA 3120 
He Tyr He Asn Gly Asn Leu He Val Glu Lys Ser He Ser Asn Leu 
1035 1030 1035 1040 

GGT GAT ATT CAT GTT AGT GAT AAT ATA TTA TTT AAA ATT GTT GGT TGT 3168 
Gly Asp He His Val Ser Asp Asn He Leu Phe Lys He Val Glv Cys 
1045 1050 1055 

GAT GAT GAA ACG TAT GTT GGT ATA AGA TAT TTT AAA GTT TTT AAT ACG 3216 
Asp Asp Glu Thr Tyr Val Gly He Arg Tyr Phe Lys Val Phe Asn Thr 
1060 1065 1070 

GAA TTA GAT AAA ACA GAA ATT GAG ACT TTA TAT AGT AAT GAG CCA GAT 3264 
Glu Leu Asp Lys Thr Glu He Glu Thr Leu Tyr Ser Asn Glu Pro Asp 
1075 1080 1085 

CCA AGT ATC TTA AAA AAC TAT TGG GGA AAT TAT TTG CTA TAT AAT AAA 3 312 

Pro Ser He Leu Lys Asn Tyr Trp Gly Asn Tyr Leu Leu Tyr Asn Lys 
3.090 1095 1100 

AAA TAT TAT TTA TTC AAT TTA CTA AGA AAA GAT AAG TAT ATT ACT CTG 3360 
Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys Asp Lys Tyr He Thr Leu 

1110 1115 1120 

AAT TCA GGC ATT TTA AAT ATT AAT CAA CAA AGA GGT GTT ACT GAA GGC 34 08 

Asn Ser Gly He Leu Asn He Asn Gin Gin Arg Gly Val Thr Glu Gly 
1125 1130 1135 

TCT GTT TTT TTG AAC TAT AAA TTA TAT GAA GGA GTA GAA GTC ATT ATA 34 56 

Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu Gly Val Glu Val He He 
1140 1145 1150 

AGA AAA AAT GGT CCT ATA GAT ATA TCT AAT ACA GAT AAT TTT GTT AGA 3504 
Arg Lys Asn Gly Pro He Asp He Ser Asn Thr Asp Asn Phe Val Arg 
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15 



20 



45 



50 



60 



65 



1155 1160 1165 

AAA AAC GAT CTA GCA TAG ATT AAT GTA GTA GAT CGT GGT GTA GAA TAT 3552 
Lys Asn Asp Leu Ala Tyr lie Asn Val Val Asp Arg Gly Val Glu Tvr 
1170 1175 1180 



CGG TTA TAT GOT GAT ACA AAA TCA GAG AAA GAG AAA ATA ATA AGA ACA 36 00 

Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lys Glu Lys He He Arg Thr 
10 1195 1200 



TCT AAT CTA AAC GAT AGC TTA GGT CAA ATT ATA GTT ATG GAT TCA ATA 364 8 

Ser Asn Leu Asn Asp Ser Leu Gly Gin He He Val Met Asp aer He 
1205 1210 1215 

GGA AAT AAT TGC ACA ATG AAT TTT CAA AAC AAT AAT GGG AGC AAT ATA 36qfi 
Gly Asn Asn Cys Thr Met Asn Phe Gin Asn Asn Asn Giy Ser Asn He 
1220 1225 1230 

GGA TTA CTA GGT TTT CAT TCA AAT AAT TTG GTT GCT AGT AGT TGG TAT 374 4 

Gly Leu Leu Gly Phe His Ser Asn Asn Leu Val Ala Ser Ser Trp Tvr 
1235 1240 1245 

TAT AAC AAT ATA CGA AGA AAT ACT AGC AGT AAT GGA TGC TTT TGG AGT 37 97 

^ Tyr Asn Asn He Arg Arg Asn Thr Ser Ser Asn Gly Cys Phe Trp Ser 

1250 1255 1260 

TCT ATT TCT AAA GAG AAT GGA TGG AAA GAA TGA 30,5 
Ser He Ser Lys Glu Asn Gly Trp Lys Glu 
1265 1270 

M) 

(2) INFORMATION FOR SEQ ID N0:71: 

(i) SEQUENCE CHARACTERISTICS: 
^. (A) LENGTH: 1274 amino acids 

-^^ (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: 

Met Pro Val Ala He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asp 
1 5 10 



15 



Asp Thr He Leu Tyr Met Gin He Pro Tvr Glu Glu Lys Ser Lys Lys 
20 25 * 3C 

Tyr Tyr Lys Ala Phe Glu He Met Arg Asn Val Trp He He Pro Glu 
35 40 45 

Arg Asn Thr He Gly Thr Asn Pro Ser Asp Phe Asp Pro Pro Ala Ser 
50 55 60 

Leu Lys Asn Gly Ser Ser Ala Tyr Tyr Asp Pro Asn Tyr Leu Thr Thr 
65 70 75 80 

Asp Ala Glu Lys Asp Arg Tyr Leu Lys Thr Thr He Lyc Leu Phe Lys 
85 90 95 

Arg He Asn Ser Asn Pro Ala Gly Lyc Val Leu Leu Gin Glu He Ser 

100 105 110 

Tyr Ala Lys Pro Tyr Leu Gly Asn Asp His Thr Pro He Asp Glu Phe 
115 120 125 

Ser Pro Val Thr Arg Thr Thr Ser Val Asn He Lys Leu Ser Thr Asn 
i30 135 140 

Val Glu Ser Ser Met Leu Leu Asn Leu Leu Val Leu Gly Ala Gly Pro 
7n 145 150 155 160 
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Asp He Phe Glu Ser Cys Cys Tyr Pro Val Arg Lys Leu He Asp Pro 
1€5 170 175 

Asp Val Val Tyr Asp Pro Ser Asn Tyr Gly Phe Gly Ser He Asn He 
180 185 190 

Val Thr Phe Ser Pro Glu Tyr Glu Tyr Thr Phe Asn Asp He Ser Gly 
195 200 205 

Gly His Asn Ser Ser Thr Glu Ser Phe He Ala Asp Pro Ala He Ser 
210 215 220 

Leu Ala His Glu Leu lie His Ala Leu His Gly Leu Tyr Gly Ala Arg 
225 230 235 240 

Gly Val Thr Tyr Glu Glu Thr He Glu Val Lys Gin Ala Pro Leu Met 
245 250 255 

He Ala Glu Lys Pro He Arg Leu Glu Glu Phe Leu Thr Phe Gly Glv 
260 265 270 

Gin Asp Leu Asn He He Thr Ser Ala Met Lys Glu Lys He Tyr Asn 
275 280 285 

Asn Leu Leu Ala Asn Tyr Glu Lys He Ala Thr Arg Leu Ser Glu Val 
290 295 300 

Asn Ser Ala Pro Pro Glu Tyr Asp He Asn Glu Tyr Lys Asp Tyr Phe 
305 310 315 

Gin Trp Lys Tyr Gly Leu Asp Lys Asn Ala Asp Glv Ser Tyr Thr Val 

325 330 ' 335 

Asn Glu Asn Lys Phe Asn Glu He Tyr Lys Lys Leu Tyr Ser Phe Thr 
340 345 350 

Glu Ser Asp Leu Ala Asn Lys Phe Lys Val Lys Cys Arg Asn Thr Tyr 
355 360 365 

Phe He Lys Tyr Glu Phe Leu Lys Val Pro Asn Leu Leu Asp Asp Asp 
370 375 380 

He Tyr Thr Val Ser Glu Gly Phe Asn He Gly Asn Leu Ala Val Asn 
385 390 395 400 

Asn Arg Gly Gin Ser He Lys Leu Asn Pro Lys He He Asp Ser He 
405 410 415 

Pro Asp Lys Gly Leu Val Glu Lys He Val Lys Phe Cys Lys Ser Val 
420 425 430 

He Pro Arg Lys Gly Thr Lys Ala Pro Pro Arg Leu Cys He Arg Val 

435 440 445 

Asn Asn Ser Glu Leu Phe Phe Val Ala Ser Glu Ser Ser Tyr Asn Glu 
450 455 460 

Asn Asp He Asn Thr Pro Lys Glu He Asp Asp Thr Thr Asn Leu Asn 
465 470 475 480 

Asn Asn T^r Arg Asn Asn Leu Asp Glu Val He Leu Asp Tyr Asn Ser 
485 490 495 

Gin Thr He Pro Gin He Ser Asn Arg Thr Leu Asn Thr Leu Val Gin 
500 505 510 

Asp Asn Ser Tyr Val Pro Arg Tyr Asp Ser Asn Gly Thr Ser Glu He 
515 520 525 

Glu Glu Tyr Asp Val Val Asp Phe Asn Val Phe Phe Tyr Leu His Ala 
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530 



535 



540 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



Gin Lys Val Pro Glu Gly Glu Thr Asn He Ser Leu Thr Ser Ser He 

550 

Asp Thr Ala Leu Leu Glu Glu Ser Lys Asp He Phe Phe Ser Ser Glu 

565 570 575 

Phe He Asp Thr He Asn Lys Pro Val Asn Ala Ala Leu Phe He Asp 
580 585 590 

Trp He Ser Lys Val He Arg Asp Phe Thr Thr Glu Ala Thr Gin Lys 
595 600 605 

Ser Thr Val Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr Val 

615 620 

Gly Leu Ala Leu Asn He He He Glu Ala Glu Lys Gly Asn Phe Glu 

630 635 640 

Glu Ala Phe Glu Leu Leu Gly Val Gly He Leu Leu Glu Phe Val Pro 
645 650 655 

Glu Leu Thr He Pro Val He Leu Val Phe Thr He Lys Ser Tyr He 
660 665 670 

Asp Ser Tyr Glu Asn Lys Asn Lys Ala He Lys Ala He Asn Asn Ser 

680 685 

Leu He Glu Arg Glu Ala Lys Trp Lys Glu He Tyr Ser Trp He Val 
690 695 700 

Ser Asn Trp Leu Thr Arg He Asn Thr Gin Phe Asn Lys Arg Lys Glu 

710 715 ' ^ y 

Gin Met Tyr Gin Ala Leu Gin Asn Gin Val Asp Ala He Lys Thr Ala 
725 730 735 

He Glu Tyr Lys Tyr Asn Asn Tyr Thr Ser Asp Glu Lys Asn Arg Leu 
740 745 750 

Glu Ser Glu Tyr Asn He Asn Asn He Glu Glu Glu Leu Asn Lys Lys 
755 760 765 

Val Ser Leu Ala Met Lys Asn He Glu Arg Phe Met Thr Glu Ser Ser 

770 775 780 

He Ser Tyr Leu Met Lys Leu He Asn Glu Ala Lys Val Gly Lys Leu 
785 790 795 800 

Lys Lys Tyr Asp Asn His Val Lys Ser Asp Leu Leu Asn Tyr He Leu 
805 810 815 

Asp His Arg Ser He Leu Gly Glu Gin Thr Asn GXu Leu Ser Asp Leu 
820 825 830 

Val Thr Ser Thr Leu Asn Ser Ser He Pro Phe Glu Leu Ser Ser Tyr 
835 840 845 

Thr Asn Asp Lys He Leu He He Tyr Phe Asn Arg Leu Tyr Lys Lys 
850 855 860 

He Lys Asp Ser Ser He Leu Asp Met Arg Tyr Glu Asn Asn Lys Phe 
865 870 875 880 

He Asp He Ser Gly Tyr Gly Ser Asn He Ser He Asn Gly Asn Val 
885 890 895 

Tyr He Tyr Ser Thr Asn Arg Asn Gin Phe Gly He Tyr Asn Ser Arg 
900 905 910 



- 362 - 



wo 98/08540 



PCT/US97/15394 



15 



30 



40 



45 



Leu Ser Glu Val Asn He Ala Gin Asn Asn Asp He He Tyr Asn Ser 
915 920 925 

Arg Tyr Gin Asn Phe Ser He Ser Phe Trp Val Arg He Pro Lys His 
930 935 940 

Tyr Lys Pro Met Asn His Asn Arg Glu Tyr Thr He He Asn Cys Met 
945 950 955 960 

Gly Asn Asn Asn Ser Gly Trp Lys He Ser Leu Arg Thr Val Arg Asp 
965 970 975 

Cys Glu He He Trp Thr Leu Gin Asp Thr Ser Gly Asn Lys Glu Asn 
980 985 990 

Leu He Phe Arg Tyr Glu Glu Leu Asn Arg He Ser Asn Tyr He Asn 
995 1000 1005 

Lys Trp He Phe Val Thr He Thr Asn Asn Arg Leu Gly Asn Ser Arg 
iOlO 1015 1020 

He Tyr He Asn Gly Asn Leu He Val Glu Lys Ser He Ser Asn Leu 

1030 1035 1040 

CUy Asp He His Val Ser Asp Asn He Leu Phe Lys He Val Gly Cya 
1045 1050 1055 

Asp Asp Glu Thr Tyr Val Gly He Arg Tvr Phe Lys Val Phe Asn Thr 
1060 1065 ' 1070 

Glu Leu Asp Lya Thr Glu He Glu Thr Leu Tyr Ser Asn Glu Pro Asp 
1075 1080 1085 



Pro Ser He Leu Lys Asn Tyr Trp Gly Asn Tyr Leu Leu Tyr Asn Lyr» 
1090 1095 1100 



Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys Asp Lys Tyr He Thr Leu 
1105 illO 1115 1120 

Asn Scr Gly He Leu Asn He Asn Gin Gin Arg Gly Val Thr Glu Gly 
1125 1130 1135 

Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu Gly Val Glu Val He He 
1140 1145 1150 

Arg Lys Asn Gly Pro He Asp He Ser Asn Thr Asp Asn Phe Val Arg 

1155 1160 1165 



Lys Asn Asp Leu Ala Tyr He Asn Val Val Asp Arg Gly Val Glu Tvr 
^0 1170 1175 1180 

Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lys Glu Lys He He Arg Thr 
1185 1190 1195 1200 

."^^ Ser Asn Leu Asn Asp Ser Leu Gly Gin He lie Val Met Asp Ser He 

1205 1210 1215 

Gly Asn Asn Cys Thr Met Asn Phe Gin Asn Asn Asn Gly Ser Asn He 

... 1220 1225 1230 

o() 

Gly Leu L6u Gly Phe His Ser Asn Asn Leu Val Ala Ser Ser Trp Tvr 
1235 1240 1245 

Tyr Asn Asn He Arg Arg Asn Thr Ser Ser Asn Gly Cys Phe Trp Ser 
^> 1250 1255 1260 

Ser He Ser Lys Glu Asn Gly Trp Lys Glu 
1265 1270 

70 (2) INFORMATION FOR SEQ ID NO: 72: 
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ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 108.. 1451 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 
His His His His His His His His His Ser Ser Gly His He Glu Glv 
5 10 15 

CGT CAT ATG GCT AGC ATG GCT ATT CTA ATT ATA TAT TTT AAT AGA TTA 
Arg His Met Ala Ser Met Ala Ho Leu He He Tvr Phe Asn Arg Leu 

25 30 ' 35 

TAT AAA AAA ATT AAA GAT AGT TCT ATT TTA GAT ATG CGA TAT GAA AAT 
Tyr Lys Lys He Lys Asp Ser Ser He Leu Asp Met Arg Tyr Glu Asn 
40 45 50 

AAT AAA TTT ATA GAT ATC TCT GGA TAT GGT TCA AAT ATA AGC ATT AAT 
Asn Lys Phe He Asp He Ser Gly Tyr Gly Ser Asn He Scr He Asn 
55 60 65 

GGA AAC GTA TAT ATT TAT TCA ACA AAT AGA AAT CAA TTT GGA ATA TAT 3 
cay Asn Val Tyr He Tyr Ser Thr Asn Arg Asn Gin Phe Gly He Tvr 
"70 75 80 

AAT AGT AGG CTT AGT GAA GTT AAT ATA GCT CAA AAT AAT GAT ATT ATA 4 
Asn ser Arg Leu Ser Glu Val Asn He Ala Gin Asn Asn Asp He He 
85 90 95 

TAC AAT AGT AGA TAT CAA AAT TTT AGT ATT AGT TTC TGG GTA AGG ATT 4 
Tyr Asn Ser Arg Tyr Gin Asn Phe Ser He Ser Plie Trp Val Arq He 
100 105 ^ 115 

CCT AAA CAC TAC AAA CCT ATG AAT CAT AAT CGG GAA TAC ACT ATA ATA 5( 
Pro Lys His Tyr Lys Pro Met Asn His Asn Arg Glu Tyr Thr He He 
120 125 130 

AAT TGT ATG GGG AAT AAT AAT TCG GGA TGG AAA ATA TCA CTT AGA ACT 
Asn Cys Met Gly Asn Asn Asn Ser Gly Trp Lys He Ser Leu Arg Thr 
135 140 145 

GTT AGA GAT TGT GAA ATA ATT TGG ACT TTA CAA GAT ACT TCT GGA AAT 5 
Val Arg Asp Cys Glu He He Trp Thr Leu Gin Asp Thr Ser Gly Asn 

150 155 160 

AAG GAA AAT TTA ATT TTT AGG TAT GAA GAA CTT AAT AGG ATA TCT AAT 6< 
Lys Glu Asn Leu He Phe Arg Tyr Glu Glu Leu Asn Arg He Ser Asn 

165 170 175 

TAT ATA AAT AAA TGG ATT TTT GTA ACT ATT ACT AAT AAT AGA TTA GGC 6 

Tyr He Asn Lys Trp He Phe Val Thr He Thr Asn Asn Arg Leu Glv 

180 185 190 ^ 

AAT TCT AGA ATT TAC ATC AAT GGA AAT TTA ATA GTT GAA AAA TCA ATT 7 

Asn Ser Arg He Tyr lie Asn Gly Asn Leu He Val Glu Lys Ser He 
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10 



20 



30 



40 



45 



50 



60 



200 205 210 

TCG AAT TTA GGT GAT ATT CAT GTT AGT GAT AAT ATA TTA TTT AAA ATT 788 
Ser Asn Leu Gly Asp lie His Val Ser Asp Asn lie Leu Phe Lys lie 
215 220 225 

GTT GGT TGT GAT GAT GAA ACG TAT GTT GGT ATA AGA TAT TTT AAA GTT 836 
Val Gly Cys Asp Asp Glu Thr Tyr Val Gly lie Arg Tyr Phe Lys Val 
230 235 240 

TTT AAT ACG GAA TTA GAT AAA ACA GAA ATT GAG ACT TTA TAT AGT AAT 884 
Phe Asn Thr Glu Leu Asp Lys Thr Glu He Glu Thr Leu Tyr Ser Asn 
245 250 255 

GAG CCA GAT CCA AGT ATC TTA AAA AAC TAT TGG GGA AAT TAT TTG CTA 932 
Glu Pro Asp Pro Ser He Leu Lys Asn Tyr Trp Gly Asn Tyr Leu Leu 
260 265 270 275 



TAT AAT AAA AAA TAT TAT TTA TTC AAT TTA CTA AGA AAA GAT AAG TAT 
Tyr Asn Lys Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys Asp Lys Tyr 
280 285 290 



980 



ATT ACT CTG AAT TCA GGC ATT TTA AAT ATT AAT CAA CAA AGA GGT GTT 1028 
He Thr Leu Asn Ser Gly He Leu Asn He Asn Gin Gin Arg Gly Val 
295 300 305 

ACT GAA GGC TCT GTT TTT TTG AAC TAT AAA TTA TAT GAA GGA GTA GAA 1076 
Thr Glu Gly Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu Gly Val Glu 
310 315 320 

GTC ATT ATA AGA AAA AAT GGT CCT ATA GAT ATA TCT AAT ACA GAT AAT 1124 
Val He He Arg Lys Asn Gly Pro He Asp He Ser Asn Thr Asp Asn 
325 330 335 

3.^ TTT GTT AGA AAA AAC GAT CTA GCA TAC ATT AAT GTA GTA GAT CGT GGT 1172 

Phe Val Arg Lys Asn Asp Leu Ala Tyr He Asn Val Val Asp Arg Gly 
340 345 350 355 



GTA GAA TAT CGG TTA TAT GCT GAT ACA AAA TCA GAG AAA GAG AAA ATA 1220 
Val Glu Tyr Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lys Glu Lys He 
360 365 370 

ATA AGA ACA TCT AAT CTA AAC GAT AGC TTA GGT CAA ATT ATA GTT ATG 1268 
He Arg Thr Ser Asn Leu Asn Asp Ser Leu Gly Gin He He Val Met 
375 380 385 

GAT TCA ATA GGA AAT AAT TGC ACA ATG AAT TTT CAA AAC AAT AAT GGG 1316 
Asp Ser He Gly Asn Asn Cys Thr Met Asn Phe Gin Asn Asn Asn Gly 
390 395 400 

AGC AAT ATA GGA TTA CTA GGT TTT CAT TCA AAT AAT TTG GTT GCT AGT 13 64 

Ser Asn He Gly Leu Leu Gly Phe His Ser Asn Asn Leu Val Ala Ser 
405 410 415 

^5 AGT TGG TAT TAT AAC AAT ATA CGA AGA AAT ACT AGC AGT AAT GGA TGC 1412 

Ser Trp Tyr Tyr Asn Asn He Arg Arg Asn Thr Ser Ser Asn Gly Cys 
420 425 430 435 



TTT TGG AGT TCT ATT TCT AAA GAG AAT GGA TGG AAA GAA TGAAAGCTT 146 0 

Phe Trp Ser Ser He Ser Lys Glu Asn Gly Trp Lys Glu 
440 445 

{2) INFORMATION FOR SEQ ID NO:73: 

65 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 



70 



(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

Met Gly His His His His His His His His His His Ser Ser Gly His 

5 10 



15 



He Glu Gly Arg His Met Ala Ser Met Ala He Leu He He Tyr Phe 
20 25 30 

Asn Arg Leu Tyr Lys Lys He Lys Asp Ser Ser lie Leu Asp Met Arg 
3S 40 45 

Tyr Giu Asn Asn Lys Phe lie Asp He Ser Gly Tyr Gly Ser A,sn He 



60 



Ser He Asn Giy Asn Val Tyr He Tyr Ser Thr Asn Arg Asn Gin Phe 

75 80 

Gly lie Tyr Asn Ser Arg Leu Ser Glu Val Asn He Ala Gin Asn Asn 
85 90 95 

Asp He lie Tyr Asn Ser Arg Tyr Gin Asn Phe Ser He Ser Phe Trp 
100 105 110 ^ 

val Arg He Pro Lys His Tyr Lys Pro Met Asn His Asn Arg Glu Tyr 

120 125 

Thr He He Asn Cys Met Gly Asn Asn Asn Ser Gly Trp Lyo He Ser 
1-^0 135 f y xe 

Leu Arg Thr Val Arg Asp Cys Glu He He Trp Thr Leu Gin Asp Thr 

ISO 155 

Ser Gly Asn Lys Glu Asn Leu He Phe Arg Tyr Glu Glu Leu Asn Arq 
165 170 

He Ser Asn Tyr He Asn Lys Trp He Phe Val Thr He Thr Asn Acn 
180 las 190 

Arg Leu Gly Asn Ser Arg He Tyr He Asn Gly Asn Leu He Val Glu 
195 200 205 

Lys Ser He Ser Asn Leu Gly Asp He His Val Ser Aap Asn He Leu 

215 220 

Phe Lyo He Val Gly Cys Asp Asp Glu Thr Tyr Val Gly He Arg Tvr 
'25 230 235 " 240 

Phe Lys Val Phe Asn Thr Glu Leu Asp Lys Thr Glu He Glu Thr Leu 
245 250 255 

Tyr ser Asn Glu Pro Asp Pro Ser He Leu Lys Asn Tyr Trp Gly Asn 
260 265 270 

Tyr Leu Leu Tyr Asn Lys Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys 

275 280 285 

Asp Lys Tyr He Thr Leu Asn Ser Gly He Leu Asn He Asn Gin Gin 
290 295 300 

Arg Gly Val Thr Glu Gly Ser Val Phe Leu Asn Tyr Lys Leu Tvr Glu 

310 315 ^ 320 

Gly Val Glu Val He He Arg Lys Asn Gly Pro He Asp He Ser Asn 
325 330 335 

Thr Asp Asn Phe Val Arg Lys Asn Asp Leu Ala Tyr He Asn Val Val 
340 345 350 

Asp Arg Gly Val Glu Tyr Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lys 
355 360 365 
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Glu Lys lie He Arg Thr Ser Asn Leu Asn Asp Ser Leu Gly Gin He 
370 375 380 

He Val Met Asp Ser He Gly Asn Asn Cys Thr Met Asn Phe Gin Asn 
385 390 395 400 

Asn Asn Gly Ser Asn He Gly Leu Leu Gly Phe His Ser Asn Asn Leu 
405 410 415 

Val Ala Ser Ser Trp Tyr Tyr Asn Asn He Arg Arg Asn Thr Ser Ser 
420 425 430 

Asn Gly Cys Phe Trp Ser Ser He Ser Lys Glu Asn Gly Trp Lys Glu 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

; SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

CGCCATGGCT ATTCTAATTA TATATTTTAA TAG 

(2) INFORMATION FOR SEQ ID N0:75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc ^ "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

GCAAGCTTTC ATTCTTTCCA TCCATTCTC 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION; l.,3891 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

ATG CCA GTT AAT ATA AAA AAC TTT AAT TAT AAT GAC CCT ATT AAT AAT 
Met Pro Val Asn He Lys Asn Phe Asn Tyr Asn Asp Pro He Asn Asn 
15 10 15 

GAT GAC ATT ATT ATG ATG GAA CCA TTC AAT GAC CCA GGG CCA GGA ACA 
Asp Asp He He Met Met Glu Pro Phe Asn Asp Pro Gly Pro Glv Thr 
20 25 30 

TAT TAT AAA GCT TTT AGG ATT ATA GAT CGT ATT TGG ATA GTA CCA GAA 
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20 



40 



55 



60 



Tyr Tyr Lys Ala Phe Arg He He Asp Arg He Trp lie Val Pro Glu 
35 40 45 

IT I^"^ """^ c:aa ttt aat gcc agt aca gga 

Arg Phe Thr Tyr Gly Phe Gin Pro Asp Gin Phe Asn Ala Ser Thr Gly 

=° 55 60 

SIT ir c^"^ ^ ^^"^ ^■^^ "^^^ TAT TAC GAT CCA ACT TAT TTA AAA 

Val Phe Ser Lys Asp Val Tyr Glu Tyr Tyr Asp Pro Thr Tyr Leu Lys 

■'0 75 BO 

ih^ t'^'^ »r Sf* ^ ^^"^ ^ ■^■^■^ ■^^'^ ^ ACA ATG ATT AAA TJA TTT 
Thr Asp Ala Glu Lys Asp Lys Phe Leu Lys Thr Met He Lys Leu Phe 

85 90 95 

AAT AGA ATT AAT TCA AAA CCA TCA GGA CAG AGA TTA CTG GAT ATG ATA 
Asn Arg He Asn Ser Lys Pro Ser Gly Gin Arg Leu Leu Asp Met He 
100 105 

CTA GAT GCT ATA CCT TAT CTT GGA AAT GCA TCT ACA CCG CCC GAC AAA 
Val Asp Ala He Pro Tyr Leu Gly Asn Ala Ser Thr Pro Pro Asp Lys 
lis 120 125 



235 240 



Tyr Asn Lys Ala Leu Gin Asn Phe Gin Asp lie Ala Asn Arg Leu Asn 
290 295 300 

ATT GTT TCA AGT GCC CAA GGG AGT GGA ATT GAT ATT TCC TTA TAT AAA 

Tl^ W-,1 r.^- - 

315 320 

368 . 



70 305 21Q '^^'^ 



192 



240 



268 



336 



384 



->S D^I n?'^ *^'^T ^™ '^^'^ '^'TT AAT AAA AAA ATT ATC CAA 4 32 

" ^^"^ AS'^ He He Gin 

•-*0 135 140 



n ^^'^ ^ GG'T "A ATG ACA AAT TTA ATA ATA 

•50 ^ ^-^^ ^y'^ ^^y ^eu Met Thr Asn Leu lie He 

150 160 

TTT GGA CCA GGA CCA GTT CTA AGT GAT AAT TTT ACT GAT AGT ATG ATT 
Phe Gly Pro Gly Pro Val Leu Ser Asp Asn Phe Thr Asp Ser Met He 
35 170 175 

ATG AAT GGC CAT TCC CCA ATA TCA GAA GGA TTT GGT GCA AGA ATG ATG 
Met Asn ^ly His Ser Pro He Ser Glu Gly Phe Gly Ala Arg Met Met 
180 185 190 

ATA AGA TTT TGT CCT AGT TGT TTA AAT GTA TTT AAT AAT GTT CAG GAA 
He Arg Phe Cys Pro Ser Cys Leu Asn Val Phe Asn Asn Val Gin Glu 
195 200 205 

AAT AAA GAT ACA TCT ATA TTT AGT AGA CGC GCG TAT TTT GCA GAT CCA 6 7: 

Asn Lys Asp Thr Ser He Phe Ser Arg Arg Ala Tvr Phe Ala Asp Pro 
210 215 220 

GCT CTA ACG TTA ATG CAT GAA CTT ATA CAT GTG TTA CAT GGA TTA TAT 

<i\ "^^^ ^eu He His Val Leu His Gly Leu Tyr 
r*U 225 230 



480 



528 



576 



624 



720 



GGA ATT AAG ATA AGT AAT TTA CCA ATT ACT CCA AAT ACA AAA GAA TTT 768 

Gly He Lys He Ser Asn Leu Pro He Thr Pro Asn Thr Lys Glu Phe 
245 250 255 

TTC ATG CAA CAT AGC GAT CCT GTA CAA GCA GAA GAA CTA TAT ACA TTC 816 

Phe Met Gin His Ser Asp Pro Val Gin Ala Glu Glu Leu Tvr Thr Phe 
260 265 270 

GGA GGA CAT GAT CCT AGT GTT ATA AGT CCT TCT ACG GAT ATG AAT ATT 864 

Gly Gly His Asp Pro Ser Val He Ser Pro Ser Thr Asp Met Asn He 

275 280 285 

TAT AAT AAA GCG TTA CAA AAT TTT CAA GAT ATA GCT AAT AGG CTT AAT 91'^ 

o:> Tvr Asn Lys Ala Leu Gin Asn Phe Gin Asp He Ala — 

290 295 300 
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CAA ATA TAT AAA AAT AAA TAT GAT TTT GTT GAA GAT CCT AAT GGA AAA 1008 
Gin He Tyr Lys Asn Lys Tyr Asp Phe Val Glu Asp Pro Asn Gly Lys 
325 330 335 

TAT AGT GTA GAT AAG GAT AAG TTT GAT AAA TTA TAT AAG GCC TTA ATG 1056 
Tyr Ser Val Asp Lys Asp Lys Phe Asp Lys Leu Tyr Lys Ala Leu Met 
340 345 350 

TTT GGC TTT ACT GAA ACT AAT CTA GCT GGT GAA TAT GGA ATA AAA ACT 1104 
Phe Gly Phe Thr Glu Thr Asn Leu Ala Gly Glu Tyr Gly He Lvs Thr 
355 360 365 

AGG TAT TCT TAT TTT AGT GAA TAT TTG CCA CCG ATA AAA ACT GAA AAA 1152 
Arg Tyr Ser Tyr Phe Ser Glu Tyr Leu Pro Pro He Lys Thr Glu Lvs 
>^ 370 375 380 



10 



20 



25 



30 



TTG TTA GAC AAT ACA ATT TAT ACT CAA AAT GAA GGC TTT AAC ATA GCT 1200 
Leu Leu Asp Asn Thr He Tyr Thr Gin Asn Glu Gly Phe Asn He Ala 
385 390 395 400 

AGT AAA AAT CTC AAA ACG GAA TTT AAT GGT CAG AAT AAG GCG GTA AAT 124 8 

Ser Lys Asn Leu Lys Thr Glu Phe Asn Gly Gin Asn Lys Ala Val Asn 
405 410 415 

AAA GAG GCT TAT GAA GAA ATC AGC CTA GAA CAT CTC GTT ATA TAT AGA 1296 
Lys Glu Ala Tyr Glu Glu He Ser Leu Glu His Leu Val He Tyr Arg 
420 425 430 

ATA GCA ATG TGC AAG CCT GTA ATG TAC AAA AAT ACC GGT AAA TCT GAA 1344 
He Ala Met Cys Lys Pro Val Met Tyr Lys Asn Thr Gly Lys Ser Glu 
435 440 445 

CAG TGT ATT ATT GTT AAT AAT GAG GAT TTA TTT TTC ATA GCT AAT AAA 13 92 

Gin Cys He He Val Asn Asn Glu Asp Leu Phe Phe He Ala Asn Lys 
450 455 460 

GAT AGT TTT TCA AAA GAT TTA GCT AAA GCA GAA ACT ATA GCA TAT AAT 144 0 

Asp Ser Phe Ser Lys Asp Leu Ala Lys Ala Glu Thr He Ala Tyr Asn 
465 470 475 480 

ACA CAA AAT AAT ACT ATA GAA AAT AAT TTT TCT ATA GAT CAG TTG ATT 1488 
Thr Gin Asn Asn Thr He Glu Asn Asn Phe Ser He Asp Gin Leu He 
485 490 495 

45 TTA GAT AAT GAT TTA AGC AGT GGC ATA GAC TTA CCA AAT GAA AAC ACA 1536 

Leu Asp Asn Asp Leu Ser Ser Gly He Asp Leu Pro Asn Glu Asn Thr 
500 505 510 



40 



50 



60 



70 



GAA CCA TTT ACA AAT TTT GAC GAC ATA GAT ATC CCT GTG TAT ATT AAA 1584 
Glu Pro Phe Thr Asn Phe Asp Asp He Asp He Pro Val Tyr He Lys 

515 520 525 



CAA TCT GCT TTA AAA AAA ATT TTT GTG GAT GGA GAT AGC CTT TTT GAA 1632 
Gin Ser Ala Leu Lys Lys He Phe Val Asp Gly Asp Ser Leu Phe Glu 
530 535 540 



TAT TTA CAT GCT CAA ACA TTT CCT TCT AAT ATA GAA AAT CTA CAA CTA 1680 
Tyr Leu His Ala Gin Thr Phe Pro Ser Asn He Glu Asn Leu Gin Leu 
545 550 555 560 

ACG AAT TCA TTA AAT GAT GCT TTA AGA AAT AAT AAT AAA GTC TAT ACT 1728 
Thr Asn Ser Leu Asn Asp Ala Leu Arg Asn Asn Asn Lys Val Tyr Thr 
565 570 575 

65 TTT TTT TCT ACA AAC CTT GTT GAA AAA GCT AAT ACA GTT GTA GGT GCT 17 76 

Phe Phe Ser Thr Asn Leu Val Glu Lys Ala Asn Thr Val Val Glv Ala 
580 585 590 



TCA CTT TTT GTA AAC TGG GTA AAA GGA GTA ATA GAT GAT TTT ACA TCT 1824 
Ser Leu Phe Val Asn Trp Val Lys Gly Val He Asp Asp Phe Thr Ser 
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600 605 

GlJ III ThK l^ ^ c^"^ ^5"^ ^'^^ ^'^^ ^ ^'^'^ 'TCA GAT GTA TCC ATA 
Ser Thr Gin Lys Ser Thr He Asp Lys Val ser Asp Vai Ser He 

615 620 

ATT ATT CCC TAT ATA GGA CCT GCT TTG AAT GTA GGA AAT GAA ACA GCT 
lie lie Pro Tyr He Gly Pro Ala Leu Asn Val Gly Thr All 



630 635 



640 



1872 



1920 



AAA GAA AAT TTT AAA AAT GCT TTT GAA ATA GGT GGA GCC GCT ATC TTA l96fl 
Lys Glu Asn Phe Lys Asn Ala Phe Glu He Gly Gly Ala Ala He Leu 
645 

mIh ir ^'^ ^'^ GTA CCT ATA GTT GGA TTT TTT ACA 

Met Glu Phe He Pro Glu Leu He Val Pro He Val Gly Phe Vhe ?hr 

665 670 

TTA GAA TCA TAT GTA GGA AAT AAA GGG CAT ATT ATT ATG ACG ATA TCC 
Leu Glu ser Tyr Val Gly Asn Lys Gly His He He Met Thr He Ser 
675 680 685 

Til ^ ^"""^ ^ "^^^ ^™ ATG TAT GGT TTG 

690 ill "^^P "^^^ "^y^* Leu 

v^^^ l^^ "^^^ °" AAT ACT CAA TTT TAT ACA ATA 

lie Val ser Gin Trp Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He 
^ "^10 715 720 

^ r^. t'^? l^^ '^'^A CAA TCA CAA GCA ATA GAA 

Lys Glu Arg Met Tyr Asn Ala Leu Asn Asn Gin Ser Gin Ala He Glu 
725 730 735 

AAA ATA ATA GAA GAT CAA TAT AAT AGA TAT AGT GAA GAA GAT AAA ATG 
Lys He He Glu Asp Gin Tyr Asn Arg Tyr Ser Glu Glu Asp Lys Met 
■^40 745 750 

AAT ATT AAC ATT GAT TTT AAT GAT ATA GAT TTT AAA CTT AAT CAA AGT 
Asn He Asn He Asp Phe Asn Asp He Asp Phe Lys Leu Asn Gin Ser 
^55 760 765 

ATA AAT TTA GCA ATA AAC AAT ATA GAT GAT TTT ATA AAC CAA TGT TCT 
He Asn Leu Ala He Asn Asn He Asp Asp Phe He Asn Gin Cys Ser 

775 780 

ATA TCA TAT CTA ATG AAT AGA ATG ATT CCA TTA GCT GTA AAA AAG TTA 
He Ser Tyr Leu Met Asn Arg Met He Pro Leu Ala Val Lys Lys Leu 

790 795 800 

AAA GAC TTT GAT GAT AAT CTT AAG AGA GAT TTA TTG GAG TAT ATA GAT 
Lys Asp Phe Asp Asp Asn Leu Lys Arg Asp Leu Leu Glu Tyr He Asd 
805 810 815 

ACA AAT GAA CTA TAT TTA CTT GAT GAA GTA AAT ATT CTA AAA TCA AAA 
Thr Asn Glu Leu Tyr Leu Leu Asp Glu Val Asn He Leu Lys Ser Lys 
820 825 830 

GTA AAT AGA CAC CTA AAA GAC AGT ATA CCA TTT GAT CTT TCA CTA TAT 
Val Asn Arg His Leu Lys Asp Ser He Pro Phe Asp Leu Ser Leu Tvr 
8^5 840 845 

ACC AAG GAC ACA ATT TTA ATA CAA GTT TTT AAT AAT TAT ATT AGT AAT 
Thr Lys Asp Thr He Leu He Gin Val Phe Asn Asn Tyr He Ser Asn 
850 855 860 

ATT AGT AGT AAT GCT ATT TTA AGT TTA AGT TAT AGA GGT GGG CGT TTA 
lie Ser Ser Asn Ala He Leu Ser Leu Ser Tyr Arg Gly Gly Arg Leu 
865 870 875 880 



2016 



2064 



2112 



2160 



2208 



>256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 
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ATA GAT TCA TCT GGA TAT GOT GCA ACT ATG AAT GTA GGT TCA GAT GTT 
He Asp Ser Ser Gly Tyr Gly Ala Thr Met Asn Val Gly Ser Asp Val 
985 890 895 

?Tf ir ^^'^ ^T^ ^^'^ AAA TTA AAT AAT TCT GAA 

He Phe Asn Asp He Gly Asn Gly Gin Phe Lys Leu Asn Asn Ser Glu 
900 905 9X0 

AAT AGT AAT ATT ACG GCA CAT CAA AGT AAA TTC GTT GTA TAT GAT AGT 
Asn Ser Asn He Thr Ala His Gin Ser Lys Phe Val Val Tyr Asp Ser 
315 920 

ATG TTT GAT AAT TTT AGC ATT AAC TTT TGG GTA AGG ACT CCT AAA TAT 
Met Phe Asp Asn Phe Ser lie Asn Phe Trp Val Arg Thr Pro ^ T^r 

^^"^ ^^'^ '^^'^ ^AA AAT GAG TAT ACA ATA ATT 

Asn Asn Asn Asp He Gin Thr Tyr Leu Gin Asn Glu Tyr Thr He lie 

950 955 

r^I tT^ ^ "^^^ '^^^ ^ GTA TCT ATT AAG GGA AAT 

Ser Cys He Lys Asn Asp Ser Gly Trp Lys Val Ser He Lys Gly Asn 
965 970 

Arc lit .P^^ ^^'^ ^T"" ^^"^ °" ^'^ ^ TCA ATA 

Arg He lie Trp Thr Leu He Asp Val Asn Ala Lys Ser Lys Ser He 
960 

'HI 11^ '^^'^ ^ AAT ATA TCA GAT TAT ATA AAT AAA 

Phe Phe Glu Tyr Ser He Lys Asp Asn He Ser Asp Tyr He Asn Lys 
995 1000 1005 

^ro III c"^^ tT^" ""^"^ ^T GAT AGA TTA GGT AAC GCA AAT ATT 

^ fn?.^"" Thr Asn Asp Arg Leu Gly Asn Ala Asn He 

^^10 1015 1020 

TAT ATA AAT GGA AGT TTG AAA AAA AGT GAA AAA ATT TTA AAC TTA GAT 
Tyr lie Asn Gly Ser Leu Lys Lys Ser Glu Lys He Leu Asn Leu Asp 
^^^^ 1030 1035 1040 

AGA ATT AAT TCT AGT AAT GAT ATA GAC TTC AAA TTA ATT AAT TGT ACA 
Arg He Asn Ser Ser Asn Asp He Asp Phe Lys Leu He Asn Cys Thr 
1045 1050 1055 

GAT ACT ACT AAA TTT GTT TGG ATT AAG GAT TTT AAT ATT TTT GGT AGA 
^sp Tnr Thr Lys Phe Val Trp He Lys Asp Phe Asn He Phe Glv Arg 
1060 1065 1070 ' 

GAA TTA AAT GCT ACA GAA GTA TCT TCA CTA TAT TGG ATT CAA TCA TC^ 
Glu Leu Asn Ala Thr Glu Val Ser Ser Leu Tyr Trp He Gin Ser Ser 
1075 1080 1085 

ThO ^^'^ 7™ ^ ^^'^ '^'^ ^T CCT TTA AGA TAC GAT ACA 

^narJ^"" ^^"^ Trp Gly Asn Pro Leu Arg Tyr Asp Thr 

1090 1095 1100 

CAA TAC TAT CTG TTT AAT CAA GGT ATG CAA AAT ATC TAT ATA AAG TAT 
Gin Tyr Tyr Leu Phe Asn Gin Gly Met Gin Asn He Tyr He Lys Tyr 
1105 1110 1115 ^^20 

TTT AGT AAA GCT TCT ATG GGG GAA ACT GCA CCA CGT ACA AAC TTT AAT 
Phe Ser Lys Ala Ser Met Gly Glu Thr Ala Pro Arg Thr Asn Phe Asn 
1125 1130 ii3ji 



2688 



2736 



2784 



2832 



2880 



2928 



2976 



3024 



3072 



3120 



3168 



3216 



3264 



3312 



3360 



3408 
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85 



90 



95 



Asn Arg He Asn Ser Lys Pro Ser Gly Gin Arg Leu Leu Asp Met He 
100 105 110 

Val Asp Ala He Pro Tyr Leu Gly Asn Ala Ser Thr Pro Pro Asp Lys 
115 120 125 

Phe Ala Ala Asn Val Ala Asn Val Ser He Asn Lys Lys He He Gin 
130 135 140 

Pro Gly Ala Glu Asp Gin He Lys Gly Leu Met Thr Asn Leu X.ie He 

150 155 160 

Phe Gly Pro Gly Pro Val Leu Ser Asp Asn Phe Thr Asp Ser Met He 
165 170 175 

Met Asn Gly His Ser Pro He Ser Glu Gly Phe Gly Ala Arg Met Met 
180 185 190 

He Arg Phe Cys Pro Ser Cys Leu Asn Val Phe Asn Asn Val Gin Glu 
195 200 205 

Asn Lys Asp Thr Ser He Phe Ser Arg Arg Ala Tyr Phe Ala Asp Pro 
210 215 220 

Ala Leu Thr Leu Met His Glu Leu He His Val Leu His Gly Leu Tyr 
225 230 235 240 

Gly He Lys He Ser Asn Leu Pro He Thr Pro Asn Thr Lys Glu Phe 
245 250 255 

Phe Met Gin His Ser Asp Pro Val Gin Ala Glu Glu Leu Tyr Thr Phe 
260 265 270 

Gly Gly His Asp Pro Ser Val He Ser Pro Ser Thr Asp Met Asn He 
275 280 285 

Tyr Asn Lys Ala Leu Gin Asn Phe Gin Asp He Ala Asn Arg Leu Asn 
290 295 300 

He Val Ser Ser Ala Gin Gly Ser Gly He Asd He Ser Leu Tyr Lvs 
305 310 315 320 

Gin He Tyr Lys Asn Lys Tyr Asp Phe Val Glu Asp Pro Asn Gly Lys 
325 330 335 

Tyr Ser Val Asp Lys Asp Lys Phe Asp Lys Leu Tyr Lys Ala Leu Met 
340 345 350 

Phe Gly Phe Thr Glu Thr Asn Leu Ala Gly Glu Tyr Gly He Lys Thr 
355 360 365 

Arg Tyr Ser Tyr Phe Ser Glu Tyr Leu Pro Pro He Lys Thr Glu Lys 

370 375 380 

Leu Leu Asp Asn Thr He Tyr Thr Gin Asn Glu Gly Phe Asn He Ala 
385 390 395 400 

Ser Lys Asn Leu Lys Thr Glu Phe Asn Gly Gin Asn Lys Ala Val Asn 
405 410 415 

Lys Glu Ala Tyr Glu Glu He Ser Leu Glu His Leu Val He Tyr Arg 
420 425 430 

He Ala Met Cys Lys Pro Val Met Tyr Lys Asn Thr Gly Lys Ser Glu 
435 440 445 

Gin Cys He He Val Asn Asn Glu Asp Leu Phe Phe He Ala Asn Lys 
450 455 460 
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Asp Ser Phe Ser Lys Asp Leu Ala Lys Ala Glu Thr He Ala Tyr Asn 

^''0 475 480 

Thr Gin Asn Asn Thr He Glu Asn Asn Phe Ser He Asp Gin Leu He 
485 

Leu Asp Asn Asp Leu Ser Ser Gly He Asp Leu Pro Asn Glu Asn Thr 
500 505 510 

Glu Pro Phe Thr Asn Phe Asp Asp He Asp He Pro Vai Tyr He Lys 
515 520 525 ^ 

Gin Ser Ala Leu Lys Lys He Phe Val Asp Gly Asp Ser Leu Phe Glu 
3-»0 535 540 

Tyr Leu His Ala Gin Thr Phe Pro Ser Asn He Glu Asn Leu Gin Leu 

550 555 560 

Thr Asn Ser Leu Asn Asp Ala Leu Arg Asn Asn Asn Lys Val Tyr Thr 
565 570 575 

Phe Phe Ser Thr Asn Leu Val Glu Lys Ala Asn Thr Val Val Glv Ala 
5B0 585 590 

ser Leu Phe Val Asn Trp Val Lys Gly Val He Asp Asp Phe Thr Ser 
595 600 605 

Glu ser Thr Gin Lys Ser Thr He Asp Lys Val Ser Asp Val Ser He 
^-^0 615 620 

He He Pro Tyr He Gly Pro Ala Leu Asn Val Gly Asn Glu Thr Ala 
^25 630 635 640 

Lys Glu Asn Phe Lys Asn Ala Phe Glu He Gly Gly Ala Ala He Leu 
6^5 650 655 

Met Glu Phe He Pro Glu Leu He Val Pro He Val Gly Phe Phe Thr 
660 665 670 

Leu Glu Ser Tyr Val Gly Asn Lys Gly His He He Met Thr He Ser 
675 680 685 

Asn Ala Leu Lys Lys Arg Asp Gin Lys Trp Thr Asp Met Tyr Gly Leu 
690 695 700 

He Val Ser Gin Trp Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He 
"^^5 710 715 720 

Lys Glu Arg Met Tyr Asn Ala Leu Asn Asn Gin Ser Gin Ala He Glu 
725 730 735 

Lys He He Glu Asp Gin Tyr Asn Arg Tyr Ser Glu Glu Asp Lys Met 
740 745 750 

Asn He Asn He Asp Phe Asn Asp He Asp Phe Lys Leu Asn Gin Ser 
755 760 765 

He Asn Leu Ala He Asn Asn He Asp Asp Phe He Asn Gin Cys Ser 
770 775 780 

He Ser Tyr Leu Met Asn Arg Met He Pro Leu Ala Val Lys Lys Leu 
785 790 795 800 

Lys Asp Phe Asp Asp Asn Leu Lys Arg Asp Leu Leu Glu Tyr He Asp 
805 810 815 

Thr Asn Glu Leu Tyr Leu Leu Asp Glu Val Asn He Leu Lys Ser Lys 
820 825 830 

Val Asn Arg His Leu Lys Asp Ser He Pro Phe Asp Leu Ser Leu Tyr 
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835 840 845 

Thr Lys Asp Thr lie Leu He Gin Val Phe Asn Asn Tyr He Ser Asn 
850 855 860 

He Ser Ser Asn Ala He Leu Ser Leu Ser Tyr Arg Gly Gly Arg Leu 
865 870 875 880 

He Asp Ser Ser Gly Tyr Gly Ala Thr Met Asn Val Gly Ser Asp Val 
885 890 895 

He Phe Asn Asp He Gly Asn Gly Gin Phe Lys Leu Asn Asn S^r Glu 
900 905 910 

Asn Ser Asn He Thr Ala His Gin Ser Lys Phe Val Val Tvr Asp Ser 
915 920 925 

Met Phe Asp Asn Phe Ser He Asn Phe Trp Val Arg Thr Pro Lys Tyr 
930 935 940 

Asn Asn Asn Asp He Gin Thr Tvr Leu Gin Asn Glu Tyr Thr He He 
945 950 955 960 

Ser Cys He Lys Asn Asp Ser Gly Trp Lys Val Ser He Lys Gly Asn 
965 970 975 

Arg He He Trp Thr Leu He Asp Val Asn Ala Lys Ser Lvs Ser He 
980 985 990 

Phe Phe Giu Tyr Ser He Lys Asp Acn He Ser Asp Tyr He Asn Lys 
995 1000 1005 

Trp Phe Ser He Thr He Thr Asn Asp Arg Leu Gly Asn Ala Asn He 
'••010 1015 1020 

Tyr He Asn Gly Ser Leu Lys Lys Ser Glu Lys He Leu Asn Leu Asp 
1025 1030 1035 1040 

Arg Tie Asn Ser Ser Asn Asp He Asp Phe Lys Leu He Asn Cys Thr 
1045 1050 1055 

Asp Thr Thr Lys Phe Val Trp He Lys Asp Phe Asn He Phe Gly Arg 
1060 1065 1070 

Giu Leu Asn Aia Thr Glu Val Ser Ser Leu Tyr Trp He Gin Ser Ser 
1075 1080 1085 

Thr Asn Thr Leu Lys Asp Phe Trp Gly Asn Pro Leu Arg Tyr Asp Thr 
1090 1095 1100 

Gin Tyr Tyr Leu Phe Asn Gin Gly Met Gin Asn He Tyr He Lys Tyr 
il05 1110 1115 1120 

Phe Ser Lys Ala Ser Met Gly Glu Thr Ala Pro Arg Thr Acn Phe Asn 
H25 1130 1135 

Asn Aia Ala He Asn Tyr Gin Asn Leu Tyr Leu Gly Leu Arg Phe He 
1140 1145 1150 

He Lys Lys Ala Ser Asn Ser Arg Asn He Asn Asn Asp Asn He Val 
1155 1160 1165 

Arg Glu Gly Asp Tyr He Tyr Leu Asn He Asp Asn He Ser Asp Glu 
1170 1175 1180 

Ser Tyr Arg Val Tyr Val Leu Val Asn Ser Lys Glu He Gin Thr Gin 
1185 1190 1195 1200 

Leu Phe Leu Ala Pro He Asn Asp Asp Pro Thr Phe Tvr Asp Val Leu 
1205 1210 ' 12X5 
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Gin He Lys Lys Tyr Tyr Glu Lys Thr Thr Tyr Asn Cys Gin lie Leu 
^220 1225 1230 

Cys Glu Lys Asp Thr Lys Thr Phe Gly Leu Phe Gly He Gly Lys Phe 
1235 1240 1245 

Val i-ys Asp Tyr Gly Tyr Val Trp Asp Thr Tyr Asp Asn Tyr Phe Cys 
1250 1255 



1260 



lie Ser Gin Trp Tyr Leu Arg Arg He Ser Glu Asn He Asn Lys Leu 
--^^ 1270 1275 1280 

Arg Leu Gly Cys Asn Trp Gin Phe He Pro Val Asp Glu Gly Trp Thr 
1285 1290 1295 

Glu 

(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

■ii) MOLECULE TYPE: DNA (genomic) 

(ixj FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 108.. 1526 

(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 6 0 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 116 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 16 4 

His His His His His Uis His His His Ser Ser Gly His He Glu Gly 
5 10 15 

CGT CAT ATG GCT AGC ATG GCT GAC ACA ATT TTA ATA CAA GTT TTT AAT -^1-^ 
Arg His Met Ala Ser Met Ala Asp Thr He Leu He Gin Val Phe Asn 
-0 25 30 35 

AAT TAT ATT AGT AAT ATT AGT AGT AAT GCT ATT TTA AGT TTA AGT TAT 260 
Asn Tyr He Ser Asn He Ser Ser Asn Ala He Leu Ser Leu Ser Tvr 
40 45 50 

AGA GGT GGG CGT TTA ATA GAT TCA TCT GGA TAT GGT GCA ACT ATG AAT 308 
Arg Gly Gly Arg Leu He Asp Ser Ser Gly Tyr Gly Ala Thr Met Asn 
55 60 65 

GTA GGT TCA GAT GTT ATC TTT AAT GAT ATA GGA AAT GGT CAA TTT AAA 356 
Val Gly Ser Asp Val He Phe Asn Asp He Gly Asn Gly Gin Phe Lys 
70 75 80 

TTA AAT AAT TCT GAA AAT AGT AAT ATT ACG GCA CAT CAA AGT AAA TTC 404 
Leu Asn Asn Ser Glu Asn Ser Asn He Thr Ala His Gin Ser Lys Phe 
85 90 95 

GTT GTA TAT GAT AGT ATG TTT GAT AAT TTT AGC ATT AAC TTT TGG GTA 4 5-5 

Val Val Tyr Asp Ser Met Phe Asp Asn Phe Ser He Asn Phe Trp Val 

105 110 115 

AGG ACT CCT AAA TAT AAT AAT AAT GAT ATA CAA ACT TAT CTT CAA AAT 500 
Arg Thr Pro Lys Tyr Asn Asn Asn Asp He Gin Thr Tyr Leu Gin Asn 
120 125 130 
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20 



65 



GAG TAT ACA ATA ATT AGT TGT ATA AAA AAT GAC TCA GGA TGG AAA GTA 
Glu Tyr Thr lie He Ser Cys He Lys Asn Asp Ser Gly Trp Lys Val 
135 140 

TCT ATT AAG GGA AAT AGA ATA ATA TGG ACA TTA ATA GAT GTT AAT GCA 
oer He Lys Gly Asn Arg He Ho Trp Thr Leu He Asp Val Asn Ala 
ISO 155 



G«T TAT ATA AAT AAA TGG TTT TCC ATA ACT ATT ACT AAT GAT AGA TTA 
Asp Tyr He Asn Lys Trp Phe Ser He Thr He Thr Asn Asp Arg Leu 

190 jgg 

GGT AAC GCA AAT ATT TAT ATA AAT GGA AGT TTG AAA AAA AGT GAA AAA 
Gly Asn Ala Asn He Tyr He Asn Gly Ser Leu Lys Lys Ser Glu Lys 
200 205 210 

ATT TTA AAC TTA GAT AGA ATT AAT TCT AGT AAT GAT ATA GAC TTC AAA 
lie Leu Asn Leu Asp Arg He Asn Ser Ser Asn Aop He Asp Phe Lvs 

215 220 225 

TTA ATT AAT TGT ACA GAT ACT ACT AAA TTT GTT TGG ATT AAG GAT TTT 
Leu He Asn Cys Thr Asp Thr Thr Lys Phe Val Trp He Lys Asp Phe 
230 235 240 



AAT GAT AAT ATA GTC AGA GAA GGA GAT TAT ATA TAT CTT AAT ATT GAT 
Asn Asp Asn He Val Arg Glu Gly Asp Tyr He Tyr Leu Asn He Asp 
''0 345 350 

AAT ATT TCT GAT GAA TCT TAC AGA GTA TAT GTT TTG GTG AAT TCT AAA 
Asn He Ser Asp Glu Ser Tyr Arg val Tyr Val Leu Val Asn ser Lys 

365 370 

GAA ATT CAA ACT CAA TTA TTT TTA GCA CCC ATA AAT GAT GAT CCT ACG 
(Uu He Gin Thr Gin Leu Phe Leu Ala Pro He Asn Asp Asp Pro Thr 
375 380 385 



TTC TAT GAT GTA CTA CAA ATA AAA AAA TAT TAT GAA AAA ACA ACA TAT 
Phe Tyr Asp Val Leu Gin He Lys Lys Tyr Tyr Glu Lys Thr Thr Tvr 
390 395 400 ^ 

AAT TGT CAG ATA CTT TGC GAA AAA GAT ACT AAA ACA TTT GGG CTG TTT 
Asn Cys Gin He Leu Cys Glu Lys Asp Thr Lys Thr Phe Gly Leu Phe 
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548 



596 



10 111 rJ'' IT ""^ '"''^ ^^"^ ^■r* ^ GAT AAT ATA TCA 644 

10 Lys ser Lys Ser He Phe Phe Glu Tyr Ser He Lys Asp Asn He Ser 

170 



692 



74 0 



788 



836 



AAT ATT TTT GGT AGA GAA TTA AAT GCT ACA GAA GTA TCT TCA CTA TAT 884 
Asn lie Phe Gly Arg Glu Leu Asn Ala Thr Glu Val Ser Ser Leu Tyr 

250 255 

TGG ATT CAA TCA TCT ACA AAT ACT TTA AAA GAT TTT TGG GGG AAT CCT 
IS ^^"^ ^^"^ '^^^ Leu Lys Asp Phe Trp Gly Asn Pro 

265 270 275 

TTA AGA TAC GAT ACA CAA TAC TAT CTG TTT AAT CAA GGT ATG CAA AAT 
Leu Arg Tyr Asp Thr Gin Tyr Tyr Leu Phe Asn Gin Gly Met Gin Asn 
40 280 285 290 

ATC TAT ATA AAG TAT TTT AGT AAA GCT TCT ATG GGG GAA ACT GCA CCA 
lie Tyr He Lys Tyr Phe Ser Lys Ala Ser Met Gly Glu Thr Ala Pro 
29B 300 305 

4> CGT ACA AAC TTT AAT AAT GCA GCA ATA AAT TAT CAA AAT TTA TAT CTT 

Arg Thr Asn Phe Asn Asn Ala Ala He Asn Tyr Gin Asn Leu Tvr Leu 

315 320 

GGT TTA CGA TTT ATT ATA AAA AAA GCA TCA AAT TCT CGG AAT ATA AAT 11?4 
^0 Gly Leu Arg Phe lie He Lys Lys Ala Ser Asn Ser Arg Asn lie Asn 
^25 330 335 



932 



980 



1028 



1076 



1172 



1220 



1268 



1316 
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405 410 415 

GGA ATT GGT AAA TTT GTT AAA GAT TAT GGA TAT GTT TGG GAT ACC TAT 1412 
Gly He Gly Lys Phe Val Lys Asp Tyr Gly Tyr Val Trp Asp Thr Tyr 
420 425 430 435 

GAT AAT TAT TTT TGC ATA AGT CAG TGG TAT CTC AG A AG A ATA TCT GAA 1460 
Asp Asn Tyr Phe Cys He Ser Gin Trp Tyr Leu Arg Arg He Ser Glu 
440 445 450 

AAT ATA AAT AAA TTA AGG TTG GGA TGT AAT TGG CAA TTC ATT CCC GTG 1508 
Asn He Asn Lys Leu Arg Leu Gly Cys Asn Trp Gin Phe He Pro Val 
455 460 465 

GAT GAA GGA TGG ACA GAA TAACTCGAG 1535 
Asp Glu Gly Trp Thr Glu 
470 

(2) INFORMATION FOR SEQ ID NO:79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 473 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
15 10 15 

He Glu Gly Arg His Met Ala Ser Met Ala Asp Thr He Leu He Gin 
20 25 30 

Val Phe Asn Asn Tyr He Ser Asn He Ser Ser Asn Ala He Leu Ser 
35 40 45 

Leu Ser Tyr Arg Gly Gly Arg Leu He Asp Ser Ser Gly Tyr Gly Ala 
50 55 60 

Thr Met Asn Val Gly Ser Asp Val He Phe Asn Asp He Gly Asn Gly 
65 70 75 80 

Gin Phe Lys Leu Asn Asn Ser Glu Asn Ser Asn He Thr Ala His Gin 
85 90 95 

Ser Lys Phe Val Val Tyr Asp Ser Met Phe Asp Asn Phe Ser He Asn 

100 105 110 

Phe Trp Val Arg Thr Pro Lys Tyr Asn Asn Asn Asp He Gin Thr Tyr 
115 120 125 

Leu Gin Asn Glu Tyr Thr He He Ser Cys He Lys Asn Asp Ser Gly 
130 135 140 

Trp Lys Val Ser He Lys Gly Asn Arg He He Trp Thr Leu He Asp 
145 150 155 160 

Val Asn Ala Lys Ser Lys Ser He Phe Phe Glu Tyr Ser He Lys Asp 
165 170 175 

Asn He Ser Asp Tyr He Asn Lys Trp Phe Ser He Thr He Thr Asn 
180 185 190 

Asp Arg Leu Gly Asn Ala Asn He Tyr He Asn Gly Ser Leu Lys Lys 

195 200 205 

Ser Glu Lys He Leu Asn Leu Asp Arg He Asn Ser Ser Asn Asp He 
210 215 220 
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Asp Phe Lys Leu He Asn Cys Thr Asp Thr Thr Lys Phe Val Trp He 
225 230 235 240 

Lys Asp Phe Asn He Phe Gly Arg Glu Leu Asn Ala Thr Glu Val Ser 
245 250 255 

Ser Leu Tyr Trp He Gin Ser Ser Thr Asn Thr Leu Lys Asp Phe Trp 
260 265 270 

Gly Asn Pro Leu Arg Tyr Asp Thr Gin Tyr Tyr Leu Phe Asn Gin Gly 
275 280 285 

Met Gin Asn He Tyr He Lys Tyr Phe Ser Lys Ala Ser Met Gly Glu 
290 295 300 

Thr Ala Pro Arg Thr Asn Phe Asn Asn Ala Ala He Asn Tyr Gin Asn 
305 310 315 320 

Leu Tyr Leu Gly Leu Arg Phe He He Lys Lys Ala Ser Asn Ser Arq 
325 330 335 

Asn He Asn Asn Asp Asn He Val Arg Glu Gly Asp Tyr He Tyr Leu 
340 345 350 

Aon He Asp Asn He Ser Asp Glu Ser Tyr Arg Val Tyr Val Leu Val 
355 360 365 

Asn ser Lys Glu He Gin Thr Gin Leu Phe Leu Ala Pro He Asn Asp 
370 375 380 

Asp Pro Thr Phe Tyr Asp Val Leu Gin He Lys Lys Tyr Tyr Glu Lvs 
3B5 390 395 400 

Thr Thr Tyr Asn Cys Gin He Leu Cys Glu Lys Asp Thr Lys Thr Phe 
405 410 415 

Gly Leu Phe Gly He Gly Lys Phe Val Lys Asp Tyr Gly Tyr Val Trp 
420 425 430 

Asp Thr Tyr Asp Asn Tyr Phe Cys He Ser Gin Trp Tyr Leu Arg Arg 
435 440 445 

He Ser Glu Asn He Asn Lys Leu Arg Leu Gly Cys Asn Trp Gin Phe 
450 455 460 

lie Pro Val Asp Glu Gly Trp Thr Glu 
465 470 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 3 0 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /deSC = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 

CGCCATGGCT GACACAATTT TAATACAAGT 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:81: 

GCCTCGAGTT ATTCTGTCCA TCCTTCATCC AC 32 

(2) INFORMATION FOR SEQ ID NO: 82: 

<i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



iix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 

rhic INFORMATION: /note= "The asparagine residue at 

this position contains an amide group." 

^- SEQUENCE DESCRIPTION: SEQ ID NO:82: 

Cys Gin Thr lie Asp Gly Lys Lys Tyr Tyr Phe Asn 
1 5 10 
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CLAIMS 

1 . A host cell containing a recombinant expression vector, said vector encoding a 
protein comprising at least a portion of a Clostridium boudinum toxin, said toxin selected 
Irom the group consisting of type B toxin and type E toxin. 

2. The host cell of Claim I, wherein and said host cell is capable of expressing 
said protein at a level greater than or equal to 5% of the total cellular protein. 

3. The host cell of Claim K wherein and said host cell is capable of expressing 
said protein as a soluble protein at a level greater than or equal to 0.25% of the total soluble 
cellular protein. 

4. The host cell of Claim I, wherein said host cell is an Escherichia coli cell. 

5. The host cell of Claim I. wherein said host cell is an insect cell. 

6. The host cell of Claim I. wherein said host cell is a yeast cell. 

7. A host cell containing a recombinant expression vector, said vector encoding a 
fusion protein comprising a non-toxin protein sequence and at least a portion of a Closiridium 
hoiulimmt loxin, said toxin selected from the group consisting of type B toxin and type H 
toxin. 

8. The host cell of Claim 7. wherein said portion of said toxin comprises the 
receptor binding domain. 

9. The host cell of Claim 7. wherein said non-toxin protein sequence comprises a 
poly-histidine tract. 

10. A vaccine comprising a fusion protein, said fusion protein comprising a non- 
toxin protein sequence and at least a portion of a Clostridium hotulinum toxin, said toxin 
selected from the group consisting of type B toxin and type E toxin. 
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11. The vaccine of Claim 10 further comprising a fusion protein comprising a non- 
toxin protein sequence and at least a portion of Clostridium hotulinum type A toxin. 

12. The vaccine of Claim 10, wherein said portion of said Closiridium hotulinum 
toxin comprises the receptor binding domain. 

13. The vaccine of Claim 10 wherein said non-toxin protein sequence comprises a 
poly-histidine tract. 

14. The vaccine of Claim 10, wherein said vaccine is substantially endotoxin-free. 

15. A method of generating antibody directed against a Clostridium hotulinum 
toxin comprising: 

a) providing in any order: 

i) an antigen comprising a fusion protein comprising a non-toxin 
protein sequence and at least a portion of a Clostridium hotulinum loxin. said 
toxin selected from the group consisting of type B toxin and type E toxin, and 

ii) a host; and 

b) immunizing said host with said antigen so as to generate an antibody. 

16. The method of Claim 15, wherein said antigen further comprises a fusion 
protein comprising a non-toxin protein sequence and at least a portion of Clostridium 
hotulinum type A toxin. 

1 7. The method of Claim 1 5, wherein said portion of said Clostridium hotulinum 
toxin comprises the receptor binding domain. 

18. The method of Claim 15 wherein said non-toxin protein sequence comprises a 
poly-histidine tract. 

19. The method of Claim 15 wherein said host is a mammal. 

20. The method of Claim 19 wherein said mammal is a human. 
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21. The method of Claim 15 further comprising step c) collecting said antibodies 
from said host. 

22. The method of Claim 21 further comprising step d) purifying said antibodies. 

23. The antibody raised according to the method of Claim 15. 

24. The antibody raised according to the method of Claim 16. 
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FIGURE 1 
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FIGURE 2 



1.7 -r 




7-0 6.6 1.3 2.6 5.3 1 x 2x 

XX X X X 10 10 

10 10 10 10 10 -5 -6 

-2 -3 -3 -4 -5 

Dilution of IgY 



wo 98/08540 



PCTAJS97/153M 




wo 98/08540 



PCTAJS97/15394 



FIGURE 4 
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FIGURE 5 




Anti-Tox. B 
Anti-Tox. A-fTox. B 
Pfeimmune IgY 
TocBRef. (0.1|xg/ml) 



10 100 1000 10000 100000 

IgY Dilution Factor 



WO98/08S40 



PCT/OS97/1S394 



FIGURE 6 
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P4=5'CTATCTAGGCCTAAAGTAT3'. Indicated restriction sites in fragments 1 and 2 are internal 
$ite$ used to done into p6EX2T vector (fragment 1 ; construct called p6A30-660) or pMALc 
vector (fragment 2; construct called |^A660-1 100). Bracketed restriction sites at ends of 
fragment 3 are pUC9 poiyfinker sites utilized to clone fragnnent 3 into pET23 vector 
(construct called pPAl 100-2680). Numbers in these constructs refer to toxin A amino add 
interval that is expressed. The shaded portion of the toxin A gene corresponds to the repeating 
iigand binding domain. 
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FIGURE? 
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FIGURE 8 
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FIGURE 9 
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FIGURE 1 1 
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FIGURE 12 
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FIGURE 13 
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FIGURE 14 
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FIGURE 16 
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FIGURE 17 
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Summary of expression constructs from the toxin U gene 



pPBIO-1750 C 



3 



pPBIO-1530 



P5» 



pMBIO-330^ 

pMB260-520^|i^ 

pPBI 0-330 Kw i^ I 
pMB10-47Ul^r 

pPBIO-520 HHh^ 



Affll H()al accI.'booRV Aflll ipel 
^PIO 



Hind 
III 



pMB510-820< 
pMB510-1110< 



pMB820-1110i 



pMB1100-1530< 



3 



pMB1530-1750 
pPBl 530-1 750 HHhfc 
pMB1570-1750 
pM 8 11 DO- 1 750 ^dZi: 



1 



P14 



200aa 



HHH =HIS-TAG 



pMB1850-1970 
PMB1750-1970 

pMB1970-2360 

PMB1750-2360 

pPB1750-2360 

P14^ 



pMB1850'2360< 



PPB1850-2360 



P14^ 



Hindi! f 





*HHH 



wo 98/08540 



PCT/US97/15394 



FIGURE 19 
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FIGURE 20 
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FIGURE 21 
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FIGURE 22 
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FIGURE 23 
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FIGURE 24 
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FIGURE 25 
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FIGURE 26 
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FIGURE 27 
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FIGURE 28 
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FIGURE 29 
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FIGURE 30 



1 2 3 4 5 6 




wo 98/08540 



PCT/US97/I53M 



FIGURE 31 
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FIGURE 32 
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FIGURE 33 
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FIGURE 34 



I g 3 ± S 




wo 98/08540 



PCT/US97/15394 




wo 98/08540 



PCT/US97/153M 



FIGURE 36 



1 2 34 5678 




i 



WO98A)8540 



PCT/US97/15394 



FIGURE 37 



1 2 3 4 5 



37/^0 



wo 98/08540 



PCT/US97/1S394 



FIGURE 38 
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FIGURE 39 
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FIGURE 40 
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